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CONTENT ADDRESSABLE MEMORY DEVICE 

CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] This application is a continuation of U.S. patent application no. 10/062,307 filed 
February 1, 2001 and entitled "Content Addressable Memory Device." U.S. patent application 
no. 10/062,307 is hereby incorporated by reference in its entirety. 

FIELD OF THE INVENTION 

[0002] The present invention relates generally to information retrieval systems and more 
particularly to content addressable memory devices. 

BACKGROUND 

[0003] Content addressable memory (CAM) devices are often used to support packet 
forwarding and classification operations in network switches and routers. A CAM device can be 
instructed to compare a search value, typically formed from one or more fields within the header 
of an incoming packet, with entries within an associative storage array of the CAM device. If 
the search value matches an entry, the CAM device generates an index that corresponds to the 
location of the matching entry within the storage array, and asserts a match flag to signal the , 
match. The index may then be used to address another storage array, either within or separate 
from the CAM device, to retrieve routing or classification information for the packet. 
[0004] Figure 1 illustrates a prior art CAM device 100 that includes a CAM array 101 coupled 
to match logic 103 via a plurality of match lines 105 (ML). During a search operation, a search 
key is compared with the contents of each row of CAM cells 107 within the CAM array 101 to 
generate a corresponding row match signal. Each row match signal is either asserted or 
deasserted to indicate a match or mismatch condition, and is output to the match logic 103 via a 
respective one of the match lines 105. The match logic 103 responds to an asserted match signal 
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by generating a match address 1 14 (MA) that corresponds to the row of CAM cells 107 that 
signaled the match, and by asserting a match flag 116 (MF) to indicate the match detection. 
[0005] Referring to Figure 2, each CAM cell 107 within the CAM array of Figure 1 includes 
both a storage element 123 and a compare circuit 125. During a search operation, a data bit 
within the storage element is compared to a corresponding bit (B S k) of the search key. If the bits 
do not match, compare circuit 125 pulls match line 105 low to indicate the mismatch. By 
integrating a storage element and compare circuit within each CAM cell 107 of the CAM array 
in this manner, a search key may be simultaneously compared with the contents of each row of 
CAM cells, thereby providing a massively parallel, and therefore extremely rapid search. 
[0006] Although the integration of compare and storage circuits within each CAM cell 107 
enables simultaneous, multi-row searching within the CAM array, the additional transistors 
required to implement the compare circuit significantly increases the size of the cell 107, 
reducing the memory density that can be achieved within the CAM device. Also, because each 
compare circuit in each row of CAM cells is simultaneously activated during a search operation, 
a relatively large amount of power is required to perform a search. This power consumption 
results in heat generation that further limits the storage density that can be achieved within the 
CAM device (i.e., due to thermal constraints). 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0007] The features and advantages of the present invention are illustrated by way of example 
and are by no means intended to limit the scope of the present invention to the particular 
embodiments shown, and in which: 

Figure 1 illustrates a prior art CAM device; 

Figure 2 illustrates a prior art CAM cell; 

Figure 3 illustrates a hash CAM device according to an embodiment of the present 
invention; 

Figure 4 is a flow diagram for a NFA operation within the hash CAM device of Figure 3; 

Figure 5 illustrates a search operation within the hash CAM device of Figure 3; 

Figure 6 illustrates an embodiment of a hash CAM device in greater detail; 

Figure 7 illustrates an embodiment of the assembler circuit, key mask logic and hash 
index generator of Figure 6; 

Figure 8 illustrates address selection logic which may be included within the hash CAM 
device of Figure 6; 

Figure 9 illustrates an embodiment of the compare logic of Figure 6; 

Figure 10 illustrates an embodiment of the multi-mode comparator of Figure 9; 

Figure 1 1 illustrates the structure of a component comparator according to one 
embodiment; 

Figure 12 illustrates the structure of a component comparator according to another 
embodiment; 

Figure 13 illustrates a compare cell according to one embodiment; 
Figure 14 illustrates a CAM device that includes a hash CAM block and an overflow 
CAM block; 
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Figure 15 illustrates an embodiment of a CAM device that includes multiple hash CAM 
blocks and an overflow CAM block; 

Figure 16 illustrates the structure of a hash CAM block that may be used within the hash 
CAM device of Figure 15; 

Figure 17 illustrates an embodiment of the block select logic of Figure 16; 

Figure 18 illustrates an embodiment of the output logic of Figure 16; 

Figure 19 illustrates an embodiment of the device priority logic of Figure 15; 

Figure 20 is a flow diagram of a NFA operation within the hash CAM device of Figure 

15; 

Figure 21 is a flow diagram of a NFA operation within the hash CAM block of Figure 16; 

Figure 22 illustrates an insert operation within the hash CAM device of Figure 15; 

Figure 23 illustrates a search operation within the hash CAM device of Figure 15; 

Figure 24 illustrates a search operation within a hash CAM block of Figure 15; 

Figure 25 illustrates a NFA operation within an exemplary CAM device having four hash 
CAM blocks, each of which outputs a fixed insert priority value; 

Figure 26 illustrates an exemplary NFA operation within the hash CAM device of Figure 
25 after a number of entries have been loaded; 

Figures 27 and 28 illustrate NFA operations within a hash CAM device that implements a 
capacity-dependent insertion policy; 

Figure 29 illustrates an alternative embodiment of a CAM device that includes multiple 
hash CAM blocks; 

Figure 30 illustrates a more detailed embodiment of the assembler/mask circuit and hash 
index generator of Figure 29; 

Figure 3 1 illustrates an embodiment of a hash CAM block that includes a segmented 
memory; 
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Figure 32 illustrates an embodiment of the output logic of Figure 3 1 ; 

Figure 33 illustrates an embodiment of the flag logic of Figure 32; 

Figure 34 illustrates an embodiment of the segment priority logic of Figure 32; 

Figure 35 illustrates alternative NFA priority logic that may be used in place of the NFA 
priority logic of Figure 34; 

Figure 36 illustrates an embodiment of the segment index encoder of Figure 32; 

Figure 37 illustrates a NFA operation within an exemplary CAM device having four hash 
CAM blocks, each of which includes a four-segment memory, and each of which outputs one of 
two insert priority values; 

Figure 38 illustrates another exemplary NFA operation within the hash CAM device of 
Figure 37 after additional" entries have been loaded; 

Figures 39 and 40 illustrate NFA operations within a hash CAM device that includes the 
NFA priority logic of Figure 35; 

Figure 41 illustrates another embodiment of a hash CAM block; 

Figure 42 illustrates the different results produced by direct hashing and indirect hashing; 
Figure 43 illustrates an embodiment of the binary CAM and the address selector of 
Figure 41; 

Figure 44 illustrates an embodiment of the output logic of Figure 41 ; 

Figure 45 illustrates the flag logic of Figure 44 according to one embodiment; 

Figure 46 illustrates a NFA priority logic embodiment that may be included within the 
segment priority logic of Figure 44; 

Figure 47 illustrates an embodiment of the segment index encoder of Figure 44; 

Figure 48 is a flow diagram of a NFA operation within a hash CAM device that includes 
multiple hash CAM blocks each according to Figure 41 ; 
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Figure 49 is a flow diagram of a NFA operation within the hash CAM block of Figure 41 ; 

and 

Figure 50 is a flow diagram of an insert operation within hash CAM device that includes 
multiple hash CAM blocks each according to Figure 41; 

Figure 51 is a flow diagram of a search operation within the hash CAM block of Figure 

41. 

Figure 52 illustrates an embodiment of an overflow CAM block having a programmable 
data storage width and a programmable priority function; 

Figure 53 illustrates an embodiment of the priority index table of Figure 52; 

Figure 54 shows a priority index table that is one embodiment of two priority storage 
circuits of the priority array of Figure 53; 

Figure 55 illustrates an exemplary operation of the priority index table of Figure 54; 

Figure 56 shows one embodiment of a priority cell for implementing the truth table of 
Table 2; 

Figure 57 illustrates an alternate embodiment of a priority cell for implementing the truth 
table of Table 2; 

Figure 58 illustrates waveforms of enable signals that may be used within a priority index 
table having priority cells according to Figure 57; 

Figure 59 illustrates an embodiment of the tag logic of Figure 53; 

Figure 60 illustrates an embodiment of the row logic of Figure 53; 

Figure 61 illustrates an embodiment of the priority encoder 805 of Figure 52; 

Figure 62 illustrates the format of a NFA mask according to one embodiment; 

Figure 63 illustrates an embodiment of the priority select logic of Figure 52; 

Figure 64 illustrates an embodiment of the index select logic of Figure 52; and 
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Figure 65 illustrates a system that includes a CAM device according to an embodiment of 
the present invention. 
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DETAILED DESCRIPTION 

[0008] In the following description, for purposes of explanation, specific nomenclature is set 
forth to provide a thorough understanding of the present invention. However, it will be apparent 
to one skilled in the art that these specific details may not be required to practice the present 
invention. In some instances, the interconnection between circuit elements or blocks may be 
shown as buses or as single signal lines. Each of the buses may alternatively be single signal 
lines, and each of the single signal lines may alternatively be buses. A signal is said to be 
"asserted" when the signal is driven to a low or high logic state (or charged to a high logic state 
or discharged to a low logic state) to indicate a particular condition. Conversely, a signal is said 
to be "deasserted" to indicate that the signal is driven (or charged or discharged) to a state other 
than the asserted state (including a high or low logic state, or the floating state that may occur 
when the signal driving circuit is transitioned to a high impedance condition, such as an open 
drain or open collector condition). A signal driving circuit is said to "output" a signal to a signal 
receiving circuit when the signal driving circuit asserts (or deasserts, if explicitly stated or 
indicated by context) the signal on a signal line coupled between the signal driving and signal 
receiving circuits. A signal line is said to be "activated" when a signal is asserted on the signal 
line, and "deactivated" when the signal is deasserted. Additionally, the prefix symbol "/" 
attached to signal names indicates that the signal is an active low signal (i.e., the asserted state is 

a logic low state). A line over a signal name (e.g., ' < signal name > ') is also used to indicate an 
active low signal. Active low signals may be changed to active high signals and vice-versa as is 
generally known in the art. 

[0009] A hash CAM device that includes a memory and hash circuitry to associate a search 
value with a unique location within the memory is disclosed in numerous embodiments. In 
contrast to the prior art CAM device described above, a search value is not compared with each 
entry within an array of CAM cells to locate a matching entry, but rather the search value is input 
Atty. Docket No. NLMI.P1 12C - 9 - 



to a hash index generator which generates a search index (i.e., address) to the memory based on 
the search value itself. Each value stored within the memory was similarly input to the hash 
index generator to generate a storage index (i.e., memory address at which the value is stored) so 
that, if the search value matches a previously stored value, the search index and storage index 
will match. Accordingly, a match may be detected by comparing the search value with the entry 
stored at the search index. If the entry and search value match, a match flag is asserted to 
indicate the match detection and the search index is output as the match address. Because 
compare circuits need not be included within each storage cell of the device memory, a 
significantly higher memory density may be achieved in the hash CAM device than in the prior 
art CAM device described above. Also, because a search value is compared with only one 
memory entry per search (as opposed to comparing a search value with all entries within a CAM 
array), significantly less power is required to search the hash CAM device than the prior art 
CAM device described above. 

Overview of a Hash CAM device 
[0010] Figure 3 illustrates a hash CAM device 130 according to an embodiment of the present 
invention. The hash CAM device 130 includes a hash index generator 131, memory 133, 
compare logic 135 and, optionally, a configuration register 137. A data bus 140 is coupled to the 
hash index generator 131 and to the compare logic 135 to provide input data values during load 
operations and search operations. The hash CAM device 130 may also include an instruction 
decoder (not shown) to generate timing and control signals in response to instructions from a 
host device, such as a network processor, general purpose processor, application specific 
integrated circuit (ASIC) or any other instruction-issuing device. 

[0011] Each data value stored in the memory 133 is referred to herein as an entry (shown 
generally at 132) and includes at least a key, KEY, and validity value, V. The key is formed 
from selected bits within the input data value and is input, at least in part, to the hash index 
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generator 131 to generate the hash index at which the entry is stored (i.e., the storage index) . 
The validity value may include one or more bits and is used to indicate the presence of a valid 
entry. An entry 132 may optionally include mask information, MSK (e.g., a mask value to be 
applied during a comparison operation, or an encoded value that can be decoded to generate a 
mask value), a priority value, PRI, which is a numeric value that indicates a priority of the entry 
relative to other entries, and control/configuration information, CNTRL/CFG, that may be used 
to control certain operations within the hash CAM device 130. The priority values included 
within the various entries within memory may be assigned in ascending priority order (i.e., lower 
numeric values indicate a higher priority than higher numeric values) or descending priority 
order (i.e., higher numeric values indicate a higher priority than lower numeric values). In the 
description that follows ascending priority order is generally assumed, though descending 
priority order may be alternatively be used. Also, in alternative embodiments, non-numeric 
values may be used to indicate priority (e.g., codewords or other indications of priority may be 
used). 

Loading and Searching the Hash CAM 
[0012] The hash index generator 1 3 1 ideally generates a unique storage index for each value to 
be loaded into the memory 133. For most applications, however, it is desirable for the hash 
index to be smaller (i.e., contain fewer constituent bits) than the corresponding input value, 
meaning that two or more different input values may yield the same storage index; an event 
referred to herein as a collision. Accordingly, before loading an input value into the memory 
133, it is desirable to determine whether the input value will yield an index to an occupied or 
unoccupied storage location within memory 133 (i.e., determine whether a collision will occur). 
Otherwise, a previously loaded entry may be unknowingly overwritten. In one embodiment, a 
collision determination is achieved through a special type of search operation referred to herein 
as a next free address (NFA) operation. In a NFA operation, the value to be stored, referred to 
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herein as an entry candidate, is provided in whole or part to the hash index generator 131 to 
produce a hash index 136 . The entry, if any, at the indexed location (i.e., the storage location 
within memory 133 indicated by the hash index) is output to the compare logic 135 which, in 
turn, asserts or deasserts the match flag 134 according to the state of the validity value of the 
entry. If the validity value indicates the presence of a valid entry, a collision between the entry 
candidate and an existing entry (i.e., a previously loaded entry) has occurred, and the compare 
logic 135 deasserts the match flag 134 to indicate the conflicting occupation of the indexed 
memory location and that the candidate input value cannot be stored. If the validity value does 
not indicate the presence of a valid entry, the compare logic 135 asserts the match flag 134 to 
indicate that the indexed memory location is unoccupied. The candidate input value is 
subsequently stored in the memory 133 at the indexed location in an operation referred to herein 
as an insert operation. 

[0013] During a search operation, a search value received via the data bus is input to the hash 
index generator to generate a hash index 136 referred to herein as a search index. The indexed 
entry 138 (i.e., entry at the location indicated by the search index) is output to the compare logic 
135 which compares the entry with the search value. If the entry matches the search value, the 
match flag 134 is asserted to indicate the match condition. The search index (i.e., hash index 
136) is output as the match address and may be used to access an associated storage 1 1 1, for 
example, to retrieve forwarding, classification, policing or other information associated with the 
entry. If the entry does not match the search value, the match flag 134 is deasserted to indicate 
that no match was detected within the hash CAM device 130 and, for example, may be used to 
qualify the read from the associated storage 111. 

[0014] Figure 4 is a flow diagram for a NFA operation within the hash CAM device of Figure 
3. Initially, at 151, a hash index is generated based on an incoming data value. At 153, memory 
is read at a location indicated by the hash index. If the location is determined to be occupied at 
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155 (e.g., based on the validity value at the indexed location), the hash CAM device is unable to 
store the incoming value and the match flag signal is deasserted at 157. If the location is 
determined to be unoccupied at 1 55, the hash CAM device asserts the match flag and outputs the 
hash index as the next free address at 159. The data value may then be stored in the memory 
location indicated by hash index in a subsequent insert operation. 

[0015] Figure 5 illustrates a search operation within the hash CAM device of Figure 3. At 171, 
a hash index is generated based on an incoming search value. At 173, an entry is read from the 
memory at a location indicated by the hash index. The entry is compared with the search value 
at 175. If the entry matches the search value, then at 179 the match flag is asserted to indicate 
the match detection, and the hash index is output as the match address. If the entry does not 
match the search value, the match flag is deasserted at 1 77. 

[0016] Figure 6 illustrates an embodiment of a hash CAM device 190. The hash CAM device 
190 includes an assembler circuit 191, key mask logic 193, hash index generator 131, memory 
133, compare logic 135, configuration register 137, and instruction decoder 203. The instruction 
decoder 203 is coupled to an instruction bus 200 to receive instructions from a host device (e.g., 
network processor, general purpose processor, application specific integrated circuit (ASIC) or 
any other instruction issuing device), and outputs control and timing signals to other circuit 
blocks within hash CAM device 190 to carry out the instructed operations. In one embodiment, 
the primary operations carried out in response to the incoming instructions are NFA, insert and 
search operations, though numerous other operations may be performed including, without 
limitation, memory read operations, memory write operations directed to explicitly provided 
addresses, test operations and so forth. 

[0017] In one embodiment, the assembler circuit 191 assembles the key, mask information, 
priority and/or control/configuration values (referred to herein as entry components) from 
potentially dispersed fields of bits within an incoming data value. An entry type value 
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programmed within the configuration register 137 indicates the location of the bit fields within 
the incoming data value and is provided to the assembler circuit 191 to enable the assembler 
circuit to properly assemble the entry components of differently formatted data types (e.g., IPv4. 
(Internet Protocol version 4), IPv6 (Internet Protocol version 6), MPLS (Multiprotocol Label 
Switching), etc.). The complete set of assembled entry components constitutes an entry 208 and 
is output from the assembler circuit to a read/write circuit 197 within the memory 133 for storage 
in an insert operation. The assembler circuit 191 also outputs the assembled key 192 to the key 
mask logic 193, and to the compare logic 135. The key provided to the compare logic 135 is 
referred to herein as a search key 210. Note that, in one embodiment, the key 192 and search key 
210 are the same assembled portion of an incoming value, but are nonetheless referred to herein 
by different names and reference numerals. 

[0018] The key mask logic 193 receives the key 192 from the assembler circuit 191 and masks 
selected portions of the key according to a key mask value programmed within the configuration 
register 137. The resulting masked key 194 is provided to the hash index generator 131. The 
hash index generator 131 generates a hash index 136 based on the masked key 134. As 
discussed above, the hash index is used during NFA and search operations to access a location 
within the memory 133. 

[0019] Figure 7 illustrates an embodiment of the assembler circuit 191, key mask logic 193 
and hash index generator 131 of Figure 6 (other embodiments may be used). Referring first to 
the assembler circuit 191, bit fields Fi-F z of an incoming data value are assembled according to 
the programmed entry type value to produce a key having constituent bit fields Ki-K x , mask 
information having constituent bit fields Mi-M w , and a priority value having constituent bit 
fields Pi-Py. As mentioned above, the mask information and/or priority value may be omitted 
depending upon the entry type of the incoming data value. Also, other entry components such as 
a control/configuration value may be assembled in alternative embodiments. The complete set of 
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entry components form entry 208 and is output to the read/write circuit, while the key 192 is 
output to the key mask logic and also, as search key 210, to the compare logic. Note that the 
constituent bit fields Fi-Fz of the incoming data value are not necessarily all the same size. 
Similarly the constituent bit fields of the key, mask, and priority value may be differently sized. 
[0020] In one embodiment, Q equally sized bit fields, K F i-K F q, within the key 192 are 
selectively masked by the key mask logic 193 according to a key mask value stored in the 
configuration register. In one embodiment, for example, each of the Q fields is a 4-bit field that 
is selectively masked (i.e., forced to a predetermined logic state) by a corresponding bit, Bp Bq, 
within the key mask value. In the key mask logic 193 of Figure 7, the bits within a given one of 
the Q fields are forced to a logic low state by a corresponding one of AND gates 221 1 -221 q if the 
corresponding key mask bit is low. In alternative embodiments, the Q fields may each include 
more or fewer than four bits and may be a single bit. In any case, the resulting masked key 194 
is output to the hash index generator 131. 

[0021] Extending the embodiment to cover ternary searches presents some challenges. In a 
ternary search, some of the bit fields of a search key (or other portion of the search value) may be 
masked and are considered as don't cares during a search operation. In the following description, 
the unmasked portion of a key is called the invariant portion and the masked portion (don't care 
portion) is called the variant portion. If the hash index were to be calculated on the entire search 
key, (including the variant and invariant portions of the entry) then the hash index when inserting 
an entry can potentially be different than the hash index calculated during a search, even though 
the search value may match the entry (i.e., when selected bits are masked during the compare 
operation). From a search perspective, introducing the key mask logic 193 effectively isolates the 
invariant portions of the key 192 both in the insertion as well as the search operations, forming 
the basis for generation of a unique hash index for each input value. It is not necessary to use the 
entire invariant portion of the key for hash index generation; it can be a sub-set of the invariant 
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portion. As an example, consider Table 1 below which illustrates how the key mask is chosen 
and illustrates the operation of a key mask logic circuit. The entries are ternary entries and the 
variant portions are shown with an 'X'. Each bit of an 8-bit key mask value corresponds to a 
respective 4-bit field within a 32-bit input key (the symbol 'h' indicates hexadecimal notation 
and the symbol V indicates binary notation): 





Key Mask Value: 1 100 001 lb 


Entry# 


Entry value 192 


After Circuit 193 


1 


XX5A5B5Ah 


005A5B00h 


2 


FF5A5AXXh 


005A5AO0h 


3 


FE5A5AXXh 


005A5A00h 



Table 1 



[0022] In the above example, the key mask value masks the first and last bytes (i.e., 8 bits) of 
the input key, thereby establishing the first and last bytes as the variant (don't care) components 
of the entries and the middle two bytes of the entries as the invariant components of the input 
key. Note that if two entries are identical after going through the key mask circuit, they will, as 
in the example above, yield identical hash indices. Conversely, if two input keys have different 
invariant portions, the masked keys will be different and, in general, will yield different hash 
indices. 

[0023] As can be seen, the key mask value may be chosen so that the variant portion of all the 
entries in the search table are in effect set to a known, pre-defined value (in this case binary 0). 
The search process can be illustrated with an example. If the search key of FF5A5A3Ch is 
presented, this key goes through the same key mask circuit and will generate the same index as 
entry#2 in the table, and will also result in a match. 
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[0024] Thus, by programming the key mask value to define a desired invariant component (or 
components) within an input key, a degree of key masking (i.e., in the variant components) may 
be achieved within the hash CAM device. This key masking feature may be used to achieve 
ternary CAM operation within a system that contains multiple hash CAM devices by 
programming a different key mask value into the configuration register of each hash CAM 
device, and storing each input value in a CAM device selected according to the invariant/variant 
key components of the input value. For example, a first hash CAM device may be assigned a 
key mask value that defines bits 24-3 1 of an IPv4 destination address as a don't care field (i.e., 
variant key component) and bits 0-23 of the destination address as a invariant key component, 
while a second hash CAM device may be assigned a key mask value that defines bits 16-23 of 
the IPv4 destination address as variant key component, and bits 0-15 and 24-3 1 as a invariant 
key component. Thereafter data values may be loaded into either the first hash CAM device or 
the second hash CAM device according to whether bits 24-31 or 16-23 may be masked during a 
search operation. This technique for achieving ternary CAM operation is discussed in further 
detail below. 

[0025] Still referring to Figure 7, the hash index generator 131 includes K hash function 
circuits 227i-227k, each of which applies a different hash function to the masked key 194. A 
hash function select value stored in the configuration register is used to select, via multiplexer 
225, one of the hash index functions 227i-227k to output the hash index 136. By this 
arrangement different hash CAM devices within a system may be programmed to use different 
hash functions to reduce the system-wide likelihood of collisions. To understand why different 
hash function selections reduce collision likelihood, consider an example in which two entries 
having identical invariant key patterns are stored in hash CAM devices having different hash 
function selections. During a subsequent NFA operation in which an entry candidate having a 
different invariant key pattern than the entries is input to both hash CAM devices, the entry 



Atty. Docket No. NLMI.P1 12C 



-17- 



candidate may, by happenstance, collide with the entry in one of the hash CAM devices but, due 
to the different hash function selections, is statistically unlikely to collide with the entries in both 
of the hash CAM devices. By contrast, if the same hash function is used within each of the two 
hash CAM devices, happenstance collision between entry candidate stored entry will occur in 
both CAM devices (i.e., the number of collisions is doubled). Thus, the use of different hash 
functions in different hash CAM devices within the same system may reduce the system-wide 
likelihood of collisions. 

[0026] Still referring to Figure 7, in one embodiment, the hash index generator 131 includes 
four hash function circuits 227 1 -2274 each of which generates a respective hash index by 
dividing the masked key by a different cyclic redundancy check (CRC) polynomial. That is, 
each of the four hash function circuits 227 is a CRC circuit that generates a different hash index 
using the same masked key 194. Numerous other hash index generation circuits may be used in 
alternative embodiments, including encryption circuits, checksum circuits, bit-reordering 
circuits, or any other circuit to process the search value or a portion thereof to generate a hash 
index ( herein processing a value refers to receiving the value as an input, and generating a 
different value in response). More generally, while the hash function circuits within the hash 
index generator are preferably designed to generate non-colliding hash indices for a broad range 
of input values, any circuit that returns an index (i.e., address) to the memory within the hash 
CAM block based on all or a selected portion of an input value may be used to implement a hash 
function circuit 227 without departing from the spirit and scope of the present invention. Also, 
the assembler circuit 191, key mask logic 193 and hash index generator 131 may viewed 
collectively to be a hash index generator. 

[0027] Referring again to Figure 6, the memory 133 includes a memory array 201, an address 
decoder 195 coupled to the memory array 201 via a plurality of word lines 204, and a read/write 
circuit 197 coupled to the memory array 201 via a plurality of bit lines 206. As discussed above, 
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compare circuits are not integrated with storage elements within the memory array 201 (i.e., the 
memory array does not include CAM cells) so that a significantly higher storage density may be 
achieved in memory 133 than in the prior art CAM device of Figure 1. In one embodiment, the 
storage elements used to implement the memory array 201 are static random access memory 
(SRAM) storage cells, though dynamic RAM cells or any other form of storage cells may 
alternatively be used. 

[0028] The address decoder 195 receives the hash index 136 from the hash index generator 
131 and decodes the hash index to activate one of the word lines 204. The activated word line 
switchably couples a corresponding row of storage elements within the memory array 201 to the 
bit lines 206, thereby enabling read and write access to the storage elements (i.e., the selected 
row). In a memory read, for example, during a NFA or search operation, the contents of the 
selected row is output via the bit lines 206 to a bank of sense amplifiers within the read/write 
circuit 197. The sense amplifiers sense the contents of the selected row and output the sensed 
data to the compare logic as entry 138. In a memory write (e.g., during an insert operation), a 
bank of write drivers within the read/write circuit 197 drives an entry 208 onto the bit lines for 
storage within the selected row. 

[0029] Figure 8 illustrates address selection logic 235 which, though not shown in Figure 6, 
may be included within the hash CAM device to enable selection of memory addresses from 
different sources. For example addresses may be provided via address bus 230 (e.g., to enable 
testing of the memory array 201). Also, in one embodiment, a register 239 is provided to buffer 
the hash index generated during a NFA operation. If the indexed location is determined not to be 
occupied, register 239 may be selected to provide the address for entry storage during a 
subsequent insertion operation. Also, during a NFA or search operation, the hash index 136 is 
selected to address the memory 133. Additional address sources may be provided in an 
alternative embodiment. In the embodiment of Figure 8, address selection is performed by a 
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multiplexer 235 in response to an address select signal 236 (ASEL). The address select signal 
236 may be provided, for example, by the instruction decoder 203 of Figure 6 or from another 
source. 

[0030] Returning again to Figure 6, the compare logic 135 operates as discussed in reference to 
Figure 3 to generate a match flag 134. That is, during a NFA operation, the compare logic 135 
outputs a match flag that indicates whether the memory location selected by the hash index is 
occupied. During a search operation, the compare logic 135 outputs a match flag 134 that 
indicates whether the search key 210 matches an entry 138 output from a memory location 
selected by the hash index. A NFA signal 212 from the instruction decoder 203 is provided to 
the compare logic 135 to select between different match flag sources according to whether a 
NFA operation or search operation is being performed. 

[0031] Figure 9 illustrates an embodiment of the compare logic 135 of Figure 6. The compare 
logic 135 receives an entry (i.e., entry key 246, mask information 244 and validity value 242) 
from the memory array and the search key 210 as inputs, and includes a multi-mode comparator 
243, mask generator 241, multiplexers 245 and 249, and AND gate 247. The mask generator 
241 decodes the mask information 244 into a decoded mask value 248. Multiplexer 245 selects, 
according to a compare mode value within the configuration register, either the decoded mask 
value 248 or the mask information 244 to be input to the multi-mode comparator 243 as the 
selected mask value. Thus, if the mask information represents an encoded mask value (e.g., as in 
the case of a prefix value included with an IPv4 entry type), the mask generator will decode the 
prefix value to generate a corresponding decoded mask value, and the compare mode value is 
programmed to select the decoded mask value 248 for input to the multi-mode comparator 243. 
Alternatively, if the mask information is a mask value rather than an encoded mask value, the 
compare mode value may be programmed to select the mask information 244 to be input to the 
multi-mode comparator 243. One example of an encoded mask value is a prefix length value 
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that indicates the number of unmasked bits within an IPv4 or IPv6 address field. In the case of 
IPv4, for example, the prefix length value is a 5-bit value that indicates how many bits within a 
32 bit address field of an IPv4 packet are to be unmasked. Thus, a n-bit prefix length value may 
be decoded by mask generator 241 to produce a 2 n bit mask value having a mask (and unmask) 
bit pattern according to the numeric value of the prefix length. For one example, when the prefix 
length value is 16 (010000 binary), the decoded prefix length value (i.e., decoded mask value) is 
0000 0000 0000 0000 1111 1111 1111 1111 binary, where a logic 1 represents a masked bit and 
a logic 0 represents an unmasked bit. In an alternative embodiment, the mask generator 241 may 
be a lookup table containing mask values that are selected (i.e., looked up) by the mask 
information 244; the looked up mask value being output as mask value 248. Alternative mask 
encoding schemes and mask generation schemes may be used in other embodiments. 
[0032] The multi-mode comparator receives the entry key 246 and search key 210 and 
compares the two keys in accordance with the mask value selected by multiplexer 245 and the 
compare mode indicated by the compare mode value. If entry key and search key are determined 
to match one another (i.e., after any masking specified by the compare mode and mask value), 
the multi-mode comparator asserts a match signal 250 to indicate the match detection. If the 
entry key and search key are determined not to match, the match signal is deasserted. The match 
signal 250 is logically ANDed with an active low validity value 242 in AND gate 247 to 
generate a qualified match signal 252. Thus, only a match between the search key and the key 
of a valid entry will result in assertion of the qualified match signal 252. Multiplexer 249 selects 
either the qualified match signal 252 or the validity value 242 to be output as a match/empty 
value 134 according to the state of the NFA signal 212. Thus, when the NFA signal is high (i.e., 
during a NFA operation), the state of the validity value is output by multiplexer 249 to indicate 
whether or not the memory location selected by the hash index is occupied. During a search 
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operation, when the NFA signal is low, the qualified match signal 252 is output to indicate 
whether a match was detected. 

[0033] Still referring to Figure 9, in one embodiment, the validity value is a single bit value 
that is low (e.g., logic 0) to indicate a valid entry and high otherwise. Accordingly, the validity 
value may alternatively be viewed, and is occasionally referred to herein, as an empty signal that 
is high (e.g., logic 1 ) when the entry-sourcing memory location is empty (i.e., does not contain 
valid data) and low when the memory location is occupied (i.e., contains valid data). Validity 
signals having more bits and different logic states may be used in alternative embodiments. 
More generally, alternative embodiments of the compare logic 135 may be used within the hash 
CAM devices of Figures 6 and 3. For example, in one alternative embodiment, the multiplexer 
249 and NFA signal 212 are omitted and the existence and location of a next free address within 
the hash CAM block is determined by circuitry external to the hash CAM device (e.g., a 
programmed processor that tracks memory usage within the hash CAM device). 
[0034] In one embodiment, the multimode comparator 243 is capable of performing different 
types of comparisons according to the programmed compare mode value. Fore example, the 
compare mode value may indicate a binary comparison, ternary comparison (i.e., comparison in 
which mismatch conditions are ignored for selected bit positions), range comparisons (i.e., 
comparison to determine whether search key is greater than or less than a given value) or any 
combination of binary, ternary and/or range comparisons. In an application where less than all 
types of comparisons are needed, the multi-mode comparator 243 may be replaced by a 
comparator that performs only the needed types of comparisons. 

[0035] Figure 10 illustrates an embodiment of the multi-mode comparator 243 of Figure 9 that 
includes N component comparators 251 i-251n (other embodiments may be used). In one 
implementation, each of the component comparators 251 receives a respective mask value 
component (MASKci-MASKoO, entry key component (KEYci-KEYcN)and search key 
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component (SKEYci-SKEYcn), and compares the search key and entry key components in 
accordance with the mask value and in accordance with a respective one of compare mode select 
signals CMSi-CMSn. The compare mode select signals are component signals of the compare 
mode value stored in the configuration register. Each of the component comparators 251 outputs 
a respective component match signal, CMi-CM N , which is ANDed with the component match 
signals output by the others of the component comparators in AND gate 253. AND gate 253 
outputs the match signal 250 to AND gate 247 of Figure 9. 

[0036] Figure 1 1 illustrates the structure of a component comparator 251 of Figure 10 
according to one embodiment. The component comparator includes compare elements 254j- 
254 F , each to compare a sub-field of an entry key component (i.e., sub-fields Ki-K F ) with a 
respective subfield of a search key component (i.e., sub-fields SKi-SK F ). Each of the F sub- 
fields may include one or more bits and each sub-field may include a different number of bits 
than others of the sub-fields. In the embodiment of Figure 1 1, the output of each compare 
element 254 is high if the corresponding entry key and search key sub-fields match. OR gates 
255j-255f receive the outputs of the compare elements 254i-254p, respectively, and also receive 
mask signals from the mask information included within the entry (MASK). The mask 
information may include bit mask values to selectively mask single-bit sub-fields, and the mask 
information may additionally include mask values to mask multi-bit sub-fields. The mask value 
that corresponds to each sub-field is supplied to an input of OR gates 255i-255p and is used to 
selectively mask a mismatch indication from a corresponding compare element. As an example, 
if the sub-field comparison within compare element 254i is to be masked, a logic high mask 
value is input to OR gate 255 1 so that, regardless of whether compare element 254j indicates a 
mismatch or a match, the output of logic OR gate 255 will be high, indicating a match condition. 
If the outputs of all the OR gates 255i-255 F are high, AND gate 258 outputs a logic high 
component match signal 256 (CM) to indicate the match condition. If any of the outputs of 
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compare elements 254i-254p are low (indicating a mismatch condition), and not masked by a 
corresponding mask value, a logic low signal will be input to AND gate 258, thereby producing a 
low component match signal 256 to indicate the mismatch condition. In an embodiment in 
which masking is not required, the OR gates 255 may be omitted from the component 
comparator of Figure 1 1 . Also, different logic states may be used in alternative embodiments. 
Further any circuit may be used to implement the OR gates 255 and/or AND gate 258 including, 
without limitation, wired OR and wired AND logic. 

[0037] Figure 12 illustrates the structure of a component comparator 251 of Figure 10 
according to another embodiment (yet other embodiments may be used). The component 
comparator 251 includes S compare cells, 257i-257s, each to compare a respective bit of an entry 
key component (i.e., bits Ki-Ks) with a corresponding bit of the search key component (bits 
SKi-SKs) in accordance with the state of a corresponding bit of the mask value component (bits 
Mi -Ms). In one embodiment, the compare cells are chained to one another via greater-than 
(GT)/less-than (LT) lines, 260i-260s-i, to enable numeric range comparisons. Each of compare 
cells 257r257 s receives a compare mode select signal (CMS) that indicates a compare mode for 
the corresponding entry and search key components, and each compare cell 257 outputs a match 
signal which is coupled to a component match line 256. In the embodiment of Figure 12, the 
component match line 256 is pulled up to a first reference level (e.g., a supply voltage level) by 
pull-up circuit 259, and pulled-down to a second reference level (e.g., a ground voltage level) by 
a mismatch or out-of-range detection within compare cells 257j-257 s . Different voltage levels 
and/or logic states may be used to signal match/mismatch in alternative embodiments. 
[0038] Figure 13 illustrates a compare cell 257 according to one embodiment. The compare 
cell 257 receives a search key bit (SK) and complement search key bit (/SK), an entry key bit (K) 
and complement entry key bit (/K), and a complement mask bit (/M) as inputs. The compare cell 
257 includes a range compare cell 275 to perform a single bit range comparison, and a match 
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compare cell 285 to perform a maskable, bitwise match comparison. The compare mode select 
signal, CMS, is used to alternately enable the range compare cell 275 and disable the match 
compare cell 275, or enable the match compare cell 285 and disable the range compare cell 275. 
More specifically, when the compare mode select signal is high, transistor 282 is switched on to 
enable operation of the match compare cell 285, and transistor 267 is switched off (i.e., due to 
the operation of inverter 270) to disable operation of the range compare cell 275. Conversely, 
when the component compare mode select signal is low, transistor 282 is switched off to disable 
the match compare cell 285 and transistor 267 is switched on to enable the range compare cell 
275. 

[0039] When the match compare cell 285 is enabled, transistors 276, 278, 277, 279, and 281 
cooperate to perform a maskable comparison of the search key bit and entry key bit. Transistor 
281 is coupled to receive the complement mask bit, /M, so that when the complement mask bit is 
low (i.e., mask bit high to mask the compare operation), transistor 281 is switched off, and 
transistors 276, 278, 277 and 279 are prevented from pulling the match line low, thereby 
masking any mismatch indication. By contrast, when the active low mask bit is high (i.e., mask 
bit low to unmask the compare operation), transistor 281 is turned on to enable transistors 276, 
278, 277 and 279 to pull the match line 256 low in the event of a mismatch between the search 
key bit and entry key bit. For example, if the entry key bit is high and the search key bit is low, 
transistors 279 and 277 will be switched on, pulling the match line low through transistors 281 
and 282. Alternatively, if the search key bit is high and the entry key bit is low, transistors 276 
and 278 will be switched on, pulling the match line through transistors 281 and 282. 
[0040] The range compare cell 275, when enabled, signals an out-of-range condition by 
asserting a logic low signal (GT;) on GT line 260i if either (1) the search key bit is greater than 
the entry key bit, or (2) the search key bit and entry key bit are equal and a greater-than signal 
(GTi_i) from a less significant compare cell is low. More specifically, if the search key bit is 
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high and the entry key bit is low (i.e., if the search key bit is greater than the entry key bit), then 
transistors 262 and 264 are switched on to form, along with enabling transistor 267, a path 
between the GT line 260i and ground, thereby pulling the GT line 260i low. Transistors 266 and 
268 are used to couple the GTVi signal on input GT line 260m to the GT line 260i if the search 
key bit and entry key bit are equal. That is, if the search key bit is high (meaning that the search 
key bit must either be equal to or greater than the search key bit), transistor 266 is switched on to 
enable the GTm signal onto the GT line 260i, and if the entry key bit is low (again meaning that 
the search key bit is equal to or greater than the entry key bit), then transistor 268 is switched on 
to enable the GTj.j signal onto the GT line 260;. Thus, if either transistor 266 or 268 is switched 
on, a low GTi_i signal (indicating that a less significant portion of the search key component is 
greater than a corresponding less significant portion of the entry key component), will be 
propagated to the GT line 260i of range compare cell 275. Referring to Figure 1 1, the 
propagation of a low GT signal continues through the chain of compare cells (from right to left 
in the exemplary diagram of Figure 11) until the signal is blocked by a compare cell that receives 
an entry key bit that is greater than the search key bit (i.e., transistors 266 and 268 are both 
switched off), or until the signal is output from the most significant compare cell in the chain 
(i.e., compare cell 257s in this example). The GT line 260s driven by the most significant 
compare cell 257s is coupled to the match line 256, thereby pulling the match line 256 low in the 
event of an out-of-range condition (i.e., search key component greater than entry key 
component). 

[0041] Note that the compare cells 257 1 -257s may alternatively pull the match line low in 
response to detecting that the search key component is less than the entry key component (i.e., 
signal a less-than (LT) condition). Referring again to Figure 13, for example, instead of 
comparing the search key bit against the complement of the stored key in transistors 262 and 
264, the complement search key bit may be compared against the uncomplemented entry key bit, 
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pulling line 260 (now a LT line) low if the entry key bit is high and the search key bit is low. 
Similarly transistors 266 may pass a LTVi signal from a less significant compare cell to the LT 
line 260 if the complement search key bit is high (meaning that the search key bit is low) or the 
entry key bit is high. 

[0042] In another embodiment, both out-of-range conditions, GT and LT, may be signaled by a 
single compare cell by using the mask bit to indicate the lower (or upper) boundary of the range 
definition. In yet another embodiment, a selection may be made between GT and LT range 
comparisons by selectively multiplexing the key bits applied to transistors 262, 264, 266 and 
268. Also, the mask transistor 281 may be omitted from the match compare cell 285 in an 
alternative embodiment in which maskable compare (i.e., ternary compare) operation is not 
needed. 

[0043] Referring again to Figure 10 each of the component comparators 251 may include any 
number of compare cells and may include compare cells that are dedicated or configurable to 
perform range comparisons, ternary comparisons and/or binary comparisons. For example, in 
alternative embodiments, one or more of the compare cells within component comparators 251 
may include only range compare cells 275 and not match compare cells 285, or only match 
compare cell 285 and not range compare cells 275. 

[0044] Reflecting on the operation of the hash CAM devices of Figures 3 and 6 it should be 
noted that in an alternative embodiment, the information maintained in the configuration register 
may be stored on an entry-by-entry basis within memory 133, effectively enabling multiple 
search tables to be stored within a single device In such an embodiment, a hash index may be 
generated based on a predetermined portion (or all) of a search value. The configuration 
information within the entry retrieved from the indexed location may then be used to qualify the 
comparison of the search value and the entry. For example, an entry type within the entry may 
be compared with a class code associated with the search value to qualify any match detection 
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(i.e., suppress assertion of the match flag 136 if the entry type and class code do not match), 
mask information within the entry may be used to mask mismatches at selected bit positions of 
the entry and search key, control information within the entry may be used to select the type of 
comparison operation to be performed (e.g., binary, ternary and/or range compare) and so forth. 
In general, any information that may be stored in the configuration register may be stored on an 
entry by entry basis within the memory 133. 

Overflow CAM 

[0045J Reflecting on the operation of the hash CAM devices described in reference to Figures 
3 and 6, it should be noted that a host device generally will not know beforehand whether a given 
data value may be stored within the hash CAM device. That is, except when the hash CAM 
device is completely empty, a preliminary search (i.e., NFA operation) is used to determine 
whether a collision with an existing hash CAM entry will occur. If a collision is detected, the 
data value may not be stored in the hash CAM device and the hash CAM device is said to be 
conflicted as to that data value. In a system that includes multiple hash CAM devices, a data 
value that cannot be stored in a conflicted hash CAM device may instead be stored in another 
hash CAM device. To maintain determinism in the data load operation, all the hash CAM 
devices in the system may be concurrently instructed to perform a NFA operation, followed by 
an insert instruction to an unconflicted one of the hash CAM devices. In the event that all the 
hash CAM devices in the system are conflicted, a data value may be stored in a backup or 
overflow CAM device that includes an array of CAM cells. Taking this concept a step further, 
the overflow CAM device may be implemented in the same integrated circuit as the hash CAM 
device (or multiple hash CAM devices), and control over the CAM devices combined within a 
unified instruction decoder or other control unit. This approach has the advantage of relieving 
system level resources (i.e., host processor, instruction buses etc.) from managing load and 
search operations. In one embodiment, referred to herein as a unified load embodiment, the 
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presence of the hash CAM is hidden from the host altogether by performing both a NFA 
operation and an insert operation within an integrated hash CAM/overflow CAM device in 
response to a single load instruction. That is, from the host perspective, the integrated hash 
CAM/overflow CAM device behaves like a conventional CAM device by deterministically 
storing a data value (i.e., storing the data value within a known time) in response to a load 
instruction. In an alternative embodiment, referred to herein as a two-phase load embodiment, a 
load operation is achieved by issuing a NFA instruction to the integrated hash CAM/overflow 
CAM device, followed by an insert instruction. 

[0046] Figure 14 illustrates a CAM device 300 that includes both hash CAM and overflow 
CAM functions in a hash CAM block 301 and overflow CAM block 303, respectively. The hash 
CAM block 301 may include, for example, the circuit blocks of hash CAM device 130 of Figure 
3 or hash CAM device 190 of Figure 6. A shared instruction decoder 309 issues control and 
timing signals to both CAM blocks in response to host instructions received via instruction bus 
200 and a clock signal received via clock line 302. A data bus 140 is input to both the hash 
CAM block and the overflow CAM block, and each of the CAM blocks 301 and 303 outputs a 
respective block flag (BF H c, BFofc), block priority value (BP H c> BP 0 fc) and block index 
(BINhc, BINofc). 

[0047] The CAM device 300 also includes a device flag circuit 305 and device priority logic 
307. The device flag circuit 305 receives the block flags from the hash CAM and overflow 
CAM blocks and logically combines the block flags to generate a device flag 304 (DF). In one 
embodiment, the block flags are combined in an OR operation so that, if either CAM block 
indicates a match condition, the device flag circuit 305 will assert the device flag 304. The 
device flag 304 may also be asserted to indicate a full/empty status for the CAM device 300 
(e.g., during a non-search operation), or a separate signal or signals may be output for such 
purposes. Further, additional flags may be output by the flag logic such as a multiple match flag 
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(to be asserted, for example, when more than CAM block indicates a match condition), an almost 
full flag (to be asserted, for example, when a CAM block reaches a predetermined fill-level), and 
so forth. The device priority logic 307 receives the block flags, block priority values and block 
indices from the hash CAM and overflow CAM blocks, and outputs, as a device priority value 
306 (DP), a highest priority one of the block priority values for which the corresponding block 
flag is asserted. The device priority logic 307 also outputs a device index 308 (DIN) that 
includes a block identifier (block ID) component and a block index component. The block 
identifier component is a block address that corresponds to the CAM block (301 or 303) that 
sourced the device priority value, and the block index component is the block index (BIN H c or 
BINofc) output by the CAM block indicated by the block ID. 

[0048] In response to a load instruction (or NFA instruction in the case of a two-phase load 
embodiment), the instruction decoder 309 initiates a NFA operation within the hash CAM block 
301 to determine whether the hash CAM block 301 is conflicted. In response, the hash CAM 
block outputs 301 a block index (BIN H c) and a block flag (BF H c) that indicates whether the 
indicated memory location is empty or occupied. In one embodiment, the instruction decoder 
309 does not enable the overflow CAM block 303 to participate in the NFA operation so that the 
overflow CAM block 303 does not assert a block flag. Consequently, during the NFA operation, 
the device flag circuit 305 outputs a device flag 304 that corresponds to the state of the hash 
CAM block flag (BF H c) and the device priority logic 307 outputs a device index 308 that 
includes a block ID that corresponds to the hash CAM block 301 and the hash CAM block index 
(BINhc). Thus, after a NFA operation within the CAM device 300, the device flag 304 will 
indicate whether a candidate entry may be stored in the hash CAM block 301 and, if so, the 
device index 308 will indicate the location within the hash CAM block 301 at which the entry 
may be inserted (i.e., the insertion address). In a two-phase load embodiment, the host may 
respond to an asserted device flag signal by issuing an insert instruction to insert the candidate 
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entry at the indicated location within the hash CAM block 301 . If the device flag is not asserted, 
the host may issue an instruction to insert the entry candidate into the next free address within 
the overflow CAM block 303. In one embodiment, during an insertion into the overflow CAM 
block 303, the overflow CAM block 303 outputs a next free address (i.e., available location 
within the overflow CAM block) and a block flag (BF 0 fc) that indicates whether the overflow 
CAM is full. The overflow CAM block flag is output by device flag circuit 305 as the device 
flag 304 (the block flag from hash CAM block 301 being deasserted) so that a host device may 
determine when the overflow CAM block 303 becomes full and cease further instructions to 
insert collision-producing entry candidates into the overflow CAM block 303. 
[0049] In an alternative embodiment of the hash CAM device 300, the overflow CAM block 
303 participates in a NFA operation by asserting its block flag signal, BFofc, if the overflow 
CAM block 303 is not full and by outputting the next free address within the overflow CAM as 
block index, BINofc* The overflow CAM block 303 further outputs a predetermined block 
priority value, BPofc, that is lower than the priority value output by an unconflicted hash CAM 
block 301 . By this arrangement, so long as the overflow CAM block 303 is not full, the CAM 
device 300 will always assert the device flag signal in response to a NFA instruction, and the 
device priority logic will select, according to which CAM block (301 or 303) outputs the highest 
priority block priority value in combination with an asserted block flag, one of the block indices, 
BINhc or BINofc, to form the block index component of the device index 308, and will generate 
a block ID that corresponds to the selected block index. Thus, if the device flag 304 is asserted 
in response to a NFA instruction, the host device may instruct the CAM device 300 to store the 
entry candidate at the block and storage location indicated by the device index 308. Note that, in 
a unified load embodiment, the instruction decoder 309 (or other control circuit within the CAM 
device 300) may respond to assertion of the device flag 304 and device index 308 by signaling 
the indicated CAM block (301 or 303) to store the candidate entry at the indicated block index. 
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In either embodiment (i.e., unified or two-phase load), a collision within the hash CAM block 
301 will not prevent the CAM device 300 from being able to store an incoming data value. Also, 
from the perspective of a host device, the unified load embodiment of CAM device 300 responds 
to a load request by carrying out the instructed load operation. The host device (and therefore 
the system designer) need not know that the CAM device 300 searches for an insertion address, 
and system level operation is not impacted by which of the hash CAM block 301 and the 
overflow CAM block 303 is ultimately selected for entry insertion. 

Using Multiple Hash CAM Blocks to Achieve Ternary CAM Operation 
[0050] As discussed above in reference to Figs 6 and 7, the portion of the key used to generate 
the storage index of an entry and the search index for a matching search key may not be masked 
without destroying the equality between the two indices. This aspect of hashing operation 
presents a significant obstacle to ternary CAM operation in which the portion of the search key 
to be masked varies from one search value to the next. 

[0051] As discussed above in reference to Figure 7, the key mask value within a configuration 
register may be used to define a desired invariant component (or components) within an input 
key, thereby defining the remainder of the key to be a maskable, variant portion of the key. 
Consequently, ternary CAM operation in which selected bits of a search value are ignored for 
match detection purposes, may be achieved at least within the variant portion of the key. In one 
embodiment of the present invention, this limited ternary CAM operation is extended by 
providing multiple hash CAM blocks within a CAM device, and by selectively storing entries 
within the hash CAM blocks according to the locations of the variant portions of their key 
components (i.e., according to their variant key definitions). For example, all entries having a 
first variant key definition may be stored in a first hash CAM block (or set of hash CAM blocks), 
all entries having a second variant key definition may be stored in a second hash CAM block (or 
set of hash CAM blocks), and so forth. By this arrangement, all or a significant percentage of 
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anticipated variant key definitions may be allocated among the multiple hash CAM blocks (or 
sets of hash CAM blocks) to achieve a relatively robust ternary CAM operation. Entry 
candidates having an unassigned variant key definition may be stored in an overflow CAM 
block. 

[0052] In one embodiment, a variant key definition is assigned to a given hash CAM block by 
programming a corresponding key mask value into the configuration register of the CAM block. 
Thereafter, as part of a NFA operation, the key mask value within each CAM block is compared 
with a mask code that indicates a variant key definition for the entry candidate (i.e., the mask 
code indicates which bits within the key of the entry candidate may be masked). Using this 
approach, the key mask value programmed within each CAM block effectively allocates the hash 
CAM block to a logical group of hash CAM blocks (or at least one hash CAM block) referred to 
herein as a mask pool. 

[0053] The entry type value programmed within the configuration register of each hash CAM 
block also allocates the hash CAM block to a logical group of one or more hash CAM blocks 
referred to herein as an entry type pool. In one embodiment, each hash CAM block is allocated 
to one of a finite number of entry type pools which are used to store only entries having the 
programmed entry type. For example, a first set of hash CAM blocks may be allocated (i.e., by 
appropriate assignment of entry type values within their respective configuration registers) to an 
IPv4 pool, a second set of hash CAM blocks may be allocated to an MPLS pool, a third set of 
hash CAM blocks may be allocated to a packet classification pool, a fourth set of hash CAM 
blocks may be allocated to an IPv6 pool, and so forth. Any hash CAM block, in any physical 
order, can be assigned to any entry type pool. 

[0054] During a NFA operation, the programmed entry type and key mask value of each hash 
CAM block are compared with the entry type and key mask value of the entry candidate. Only 
those hash CAM blocks which fall within the mask pool and entry type pool indicated by the 
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entry candidate are enabled to participate in the NFA operation, with all other CAM blocks being 
disabled from asserting their block flags (i.e., disabled from indicating an ability to store the 
candidate entry). Thus, all hash CAM blocks configured with the same key mask value and 
entry type define a logical pool, called an insertion pool, into which an entry having a matching 
key mask value and entry type may be asserted. 

[0055] During a search operation, the entry type for an incoming search value is compared 
with the entry type of each hash CAM block, and only those hash CAM blocks which have been 
programmed with a matching entry type are enabled to participate in the search. That is, all hash 
CAM blocks which do not fall within the indicated entry type pool are disabled from signaling a 
match. 

CAM Device with Multiple Hash CAM Blocks 
[0056] Figure 1 5 illustrates an embodiment of a CAM device 3 1 0 that includes multiple hash 
CAM blocks 319i-319 N along with an overflow CAM block 321. Each of the hash CAM blocks 
319 outputs a respective block flag (BFi-BF N ), block priority value (BPi-BP N ), and block index 
(BINi-BIN N ), as does the overflow CAM block 321 (i.e., BF 0 fc, BPofc and BIN 0 fc). Device 
flag circuit 323 receives the block flag signals (BF) from each of the hash CAM blocks 319 and 
the overflow CAM block 321, and a device priority logic circuit 325 receives the block priority 
value, block index, and block flag signal from each of the hash CAM blocks 3 1 9 and the 
overflow CAM block 321 . The hash CAM device 310 operates similarly to the hash CAM 
device 300 of Figure 14 to perform NFA, insert and search operations in response to 
corresponding signals from an instruction decoder 302. During a search operation, each hash 
CAM block 319j-319n and the overflow CAM block 321 is searched concurrently to generate a 
set of block flags (BFi-BF N , BF 0 fc) 3 block priority values (BPi-BP N , BP 0 fc) and block indices 
(BINi-BINn, BINofc)- The device flag circuit 323 asserts the device flag 304 if any of the block 
flag signals are asserted. The device priority logic 307 generates a device index and device 
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priority value according to the highest priority block priority value for which the corresponding 
block flag is asserted. During a NFA operation, each hash CAM block generates a block flag to 
indicate whether the CAM block is conflicted, and also a block priority value that may be used to 
select between multiple unconflicted CAM blocks according to a predetermined insertion policy. 
[0057] Figure 1 6 illustrates the structure of a hash CAM block 3 1 9 that may be used within the 
hash CAM device 310 of Figure 15. The hash CAM block 319 includes an assembler circuit 
191, key mask logic 193, hash index generator 131, address select logic 235, memory 133 and 
configuration register 137, all of which operate generally as described in reference to Figures 6- 
8, as well as output logic 335. Referring briefly to Figure 15, it should be noted that the.size of 
the memory 133 within a given one of the hash CAM blocks 31 9i -31 9 N may be different from 
others of the hash CAM blocks. For example, in one embodiment, some of the hash CAM 
blocks contain memories with 256 storage rows, while others of the hash CAM blocks contain 
memories with 512 storage rows. Other sized memories may be used in alternative 
embodiments. 

[0058] The hash CAM block 3 1 9 further includes block select logic 33 1 to generate a block 
select signal 332 according to an incoming search ID value 330 (SID). In one embodiment, the 
search ID 330 is incorporated or encoded within the operation code or operand of an incoming 
NFA instruction or search instruction. In a NFA instruction, the search ID includes a mask code 
and class code that correspond to a key mask value and entry type value, respectively, and 
therefore specify an insertion pool. The block select logic compares the mask code and class 
code against the key mask and entry type values, respectively, programmed into the 
configuration register 137. If the specified codes match the programmed values, the block select 
logic 33 1 asserts the block select signal 332 to enable the output logic 335 to assert the block flag 
338 (i.e., if no collision is detected). If the codes do not match the programmed values (i.e., the 
specified insertion pool does not match the programmed insertion pool), the block select logic 
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331 deasserts the block select signal 332 to disable the output logic 335 from asserting the block 
flag 338, thereby preventing the hash CAM block from being selected for entry insertion. 
[0059] In a search instruction, the search ID includes a class code, but no mask code. 
Accordingly, the block select logic 331 compares only the programmed entry type value with the 
class code. If the class code and programmed entry type match (i.e., programmed entry type 
pool matches the search-specified entry type pool), the block select signal 332 is asserted to 
enable the output logic 335 to assert the block flag 338 (i.e., if a match is detected between 
search value and entry). If the programmed and specified entry type pools do not match, the 
block select signal 332 is deasserted to disable the output logic 335 from asserting the block flag 
338. Note that, in alternative embodiments, the block select signal 332 may be applied to any 
other circuit blocks within the hash CAM block to prevent operation of those circuits (e.g., to 
save power) in the event of a mismatch between the search ID and programmed value(s). 
[0060] Figure 17 illustrates an embodiment of the block select logic 331 of Figure 16 (other 
embodiments may be used). The block select logic 33 1 includes a pair of comparators 343 and 
345, AND gate 347 and a multiplexer 349. As shown, an input search ID 330 includes a class 
code (CC) that specifies an entry type pool and, in a NFA operation, a mask code (MC) that 
specifies a mask pool. During a NFA operation, the mask code is compared to the key mask 
value in comparator 345 and the class code is compared with the entry type value in comparator 
343, the key mask and entry type values being stored within the configuration register of the 
hash CAM block. If the entry type matches the class code and the key mask matches the mask 
code, both inputs to AND gate 347 will be high resulting in a logic high output to a first input 
port of the multiplexer 349. The NFA signal 212 is high during the NFA operation, causing the 
multiplexer 349 to select the output of AND gate 347 to drive the block select signal 332. Thus, 
during a NFA operation, the block select signal 332 will have a high or low state according to 
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whether the insertion pool specified by the search ID matches the insertion pool defined by the 
programmed key mask and entry type values. 

[0061] During a search operation, the NFA signal is low so that the multiplexer 349 selects the 
output of comparator 343 to set the state of the block select signal 332. Consequently, during a 
search operation, the block select signal 332 will have a high or low state according to whether 
the entry type pool specified by the search ID matches the entry type pool indicated by the 
programmed entry type value. 

[0062] Figure 18 illustrates an embodiment of the output logic 335 of Figure 16 (other 
embodiments may be used). The output logic 335 includes the compare logic 135 discussed 
above in reference to Figure 6 as well as AND gate 355 to disable assertion of the block flag 338 
if the block select signal 332 is not asserted (i.e., to a logic high state in this example). 
[0063] The block priority value 336 output during a NFA operation is referred to herein as an 
insert priority value, and is compared with insert priority values output by other hash CAM 
blocks to select one of a number of available hash CAM blocks to store the input value. During 
a search operation, by contrast, the priority component (PRI) of the indexed entry 138 (referred 
to as a search priority value) is output as the block priority value 336. The output logic 335 
includes a multiplexer 359 to select, either an insert priority value or the search priority value to 
be output as block priority value 336 according to the state of the NFA signal 212. That is, if the 
NFA signal is low, indicating a search operation, the priority component of entry 138 is selected 
to be output as the block priority value 336. If the NFA signal is high, indicating a NFA 
operation, an insert priority value is selected. In one embodiment, the insert priority value is a 
predetermined priority value (zero in this example). Alternatively, in an embodiment that 
implements a capacity-dependent insertion policy, the insert priority value is maintained in a li- 
bit fill counter 357 that is incremented in response to an insertion operation within the hash CAM 
block (in which case signal WR is asserted, for example, by the instruction decoder) and 
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decremented in response to a delete operation within the hash CAM block (in which case signal 
DEL is asserted). Thus, the insertion priority value indicates the fill level of the hash CAM 
block, and, when output as block priority value 336, enables the device priority logic 325 of 
Figure 15 to select between multiple unconflicted hash CAM blocks (i.e., for entry insertion) on 
the basis of which is most or least filled. 

[0064] Figure 19 illustrates an embodiment of the device priority logic 325 of Figure 15 (other 
embodiments may be used). (Note that the device priority logic 325 may also be used in the hash 
CAM block of Figure 14; a case in which there is only one hash CAM block). The device 
priority logic 325 includes a priority compare circuit 361, priority encoder 363 and index selector 
365. As discussed in reference to Figure 15, the device priority logic 325 receives the block 
flags, block priority values and block indices from each of N hash CAM blocks (i.e., BFi-BF N , 
BPi-BP N , and BINi-BIN N ) and also the block flag, block priority value and block index from the 
overflow CAM device (BFofc3Pofc3INofc), and outputs a highest priority one of the block 
priority values, for which the corresponding block flag signal is asserted, as a device priority 
number (DP). Herein, a block priority value for which the corresponding block flag signal is 
asserted is referred to as a match-qualified block priority value, and, conversely, a block priority 
value for which the corresponding block flag signal is deasserted is referred to as a disqualified 
block priority value. During a given NFA or search operation, disqualified block priority values 
are disabled from participation in a priority number comparison operation (e.g., by being forced 
to a lowest possible priority value) such that only a match-qualified block priority value may be 
output as the device priority number. In one embodiment, if all the block priority values are 
disqualified (i.e., no match signals are asserted), a lowest possible priority number is output as 
the device priority number 306. 

[0065] The priority compare circuit also outputs a plurality of qualified match signals 366j- 
366 N +i, each of which corresponds to a respective one of the N+l CAM blocks (i.e., match 
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signals 366i-366n correspond to the N hash CAM blocks 319 and match signal 366n+i 
corresponds to the overflow CAM block 321). Each qualified match signal, if asserted, indicates 
that the corresponding CAM block sourced (i.e., output to the priority compare circuit) the 
device priority number. The priority encoder 363 encodes the qualified match signals into a 
block identifier (BID) that identifies the CAM block that sourced the device priority number. 
Note that multiple CAM blocks may source the device priority number (a multiple match 
condition), in which case the priority encoder generates a block identifier according to a 
predetermined tie resolution policy. For example, in one embodiment, the priority encoder 
generates a block identifier that identifies the lowest numbered one of the CAM blocks for which 
qualified match signals are asserted. In one embodiment, for example, the lowest numbered one 
of the CAM blocks is the CAM block having the lowest physical address for the memory address 
space spanned by the hash CAM blocks. Different priority encoding policies may be used in 
alternative embodiments. 

[0066] Still referring to Figure 19, the block identifier is supplied to the index selector where it 
is used to select the block index output by the identified CAM block as a selected block index 
(BINsel). In one embodiment, the index selector is a multiplexer circuit that selects the block 
index to be output as BINsel in response to the block identifier. Together, the block identifier 
and the selected block index constitute the device index, DIN. 

[0067] Figure 20 is a flow diagram of a NFA operation within the hash CAM device of Figure 
15. In response to a NFA signal from the instruction decoder, NFA operations are performed 
concurrently within each of the N hash CAM blocks at 375i-375n to generate a corresponding set 
of N block priority values and N block flags. At 377, a block identifier (BID) is generated by 
priority encoding the match-qualified block priority values. Also at 377, the block index that 
corresponds to the block ID is chosen to be the selected block index, BIN S el. At 379, a device 
flag (DF) is generated by logically ORing the block flag signals received from the hash CAM 
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blocks, and the device index (DIN), formed by the block identifier and selected block index, is 
output. 

[0068] Still referring to Figure 20, it should be noted that the block flag signal and block 
priority value from the overflow CAM device may also included in the block identifier, block 
index selection and device flag generation operations. In one embodiment, block flag signal 
from the overflow CAM, BFofc, is suppressed during a NFA operation (i.e., disabled from being 
asserted), so that the overflow CAM device is not identified by the block ID and does not 
contribute the device flag generation. In an alternative embodiment, discussed below, the 
overflow CAM device is enabled to participate in a NFA operation. 

[0069] Figure 21 is a flow diagram of a NFA operation within an i th one of the hash CAM 
blocks of Figure 15, i being an integer value between 1 and N, inclusive. At 381, the class code 
of the NFA instruction (CC) is compared with the entry type of the CAM block (ETi), and the 
mask code of instruction (MC) is compared with the key mask of the CAM block (KMi) . If the 
class code matches the block entry type and the mask code matches the block key mask, then at 
385, a key is generated from a selected portion of the incoming value. A hash index is generated 
based on the key at 387, and at 389 an entry is read from the hash block memory at the location 
indicated by the hash index. Finally, at 391, the hash index is output as the block index; the 
validity value of the entry read from memory is output as the block flag; and a predetermined 
value (e.g., 0) or, in an alternative embodiment, a value indicative of the block fill level is output 
as the block priority value. If, at 381, the class code does not match the block entry type, or the 
mask code does not match the block key mask, the block flag is deasserted at 383. 
[0070] Figure 22 illustrates an insert operation within the hash CAM device of Figure 15. The 
device flag is evaluated at 395. If the device flag is not set, then no hash CAM block asserted its 
block flag during a preceding NFA operation, and the data value is stored in the overflow CAM 
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(397). If the device flag is set, then at least one hash CAM block is available to store the data 
value, and the data value is stored at the location indicated by the device index (399). 
[0071] Figure 23 illustrates a search operation within the hash CAM device of Figure 1 5. At 
405i-405n+i, each of the N hash CAM blocks and the overflow CAM block are searched in 
parallel to generate a set of block flags, block priority values, and block indices. At 407, a block 
identifier (BID) and device priority (DP) are generated by priority encoding the block flags 
according to the block priority values. At 409, one of the N+l block indices (i.e., BINi, BIN2, 
. . ., BIN N or BINofc) is selected according to the block identifier to be BINsel, the selected block 
index. At 409, the block identifier and selected block index are output to form the device index 
(DIN), and the device flag (DF) is generated by logically ORing the block flags from the hash 
CAM blocks and overflow CAM block. 

[0072] Figure 24 illustrates a search operation within a hash CAM block of Figure 15. At 41 5, 
the class code of the incoming search instruction (CC) is compared with the entry type (ETi) of 
the hash CAM block. If the class code does not match the entry type, the block flag (BFi) is 
deasserted at 417 and the search operation is concluded. If the class code does match the entry 
type, then a key is generated at 419, and used to generate a hash index at 421 . At 423, an entry is 
read from the hash CAM block memory at a location indicated by the hash index. At 425, the 
block index is assigned to be the hash index and output from hash CAM block. Also at 425, the 
block flag (BFi) is asserted or deasserted according to whether the search key matches the entry 
key (i.e., after accounting for any masking indicated by the mask information within the entry), 
and the priority field of the entry is output as the block priority value (BPi). 
[0073] Figure 25 illustrates a NFA operation within an exemplary CAM device 429 having 
four hash CAM blocks, 319i-3194, each of which outputs a fixed insert priority value. For 
simplicity, each hash CAM block is depicted as having six storage locations, with hash CAM 
blocks 1 and 2 (319 1, 3192) programmed to select the hash index generated by a first type of hash 
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function circuit 431 and hash CAM blocks 3 and 4 (3193, 3194) programmed to select a hash 
index generated by a second type of hash function circuit 433. Thus, for a given masked key 
value (MKEY), the hash indices generated within hash CAM blocks 1 and 2 will be the same 
(i.e., index 340), and the hash indices generated within CAM blocks 3 and 4 will be the same 
(i.e., index 341), but, except in cases of coincidence, index 340 will be different from index 341. 
Also for simplicity, the key mask values and entry type values programmed within the four hash 
CAM blocks are assumed to be the same (i.e., all four hash CAM blocks are in the same 
insertion pool). 

[0074] Because all four hash CAM blocks 319i-319 4 are initially empty, each hash CAM block 
will respond during a NFA operation by asserting its block flag and outputting the same fixed 
block priority value. Thus, the device priority logic (i.e., element 325 of Figures 15 and 19) will 
select one of the hash CAM blocks according to a predetermined, hardwired tie resolution policy. 
In this particular example, lower numbered hash CAM blocks (e.g., those with lower physical 
addresses) are given higher priority over higher numbered hash CAM blocks so that, even 
though all four hash CAM blocks output the same priority number, the device priority logic will 
generate a block identifier (BID) that corresponds to hash CAM block 1 (319i). (In alternative 
embodiments, different tie resolution policies may be used, and a Value within the block 
configuration register may be used to select between multiple tie resolution policies). Thus, in 
the example of Figure 24, hash index 340 and a block identifier for hash CAM block 1 form the 
device index output by the CAM device 429. Accordingly, during an ensuing insert operation, 
an entry is stored in hash CAM block 1 at the location pointed to by hash index 340. This 
location within the hash CAM device (i.e., location 430) is referred to herein as an insertion 
address, and is indicated in Figure 24 by a lined shading pattern . 

[0075] Figure 26 illustrates an exemplary NFA operation within the hash CAM device 429 of 
Figure 25 after a number of entries (indicated by cross-hatched shading) have been loaded into 



Atty. Docket No. NLMI.P1 12C 



-42- 



the hash CAM blocks 319i-3194. During the NFA operation, hash CAM block 1 deasserts its 
block flag to signal its inability to store the data value that produced the masked key because an 
entry has previously been inserted at the indexed location. That is, hash CAM block 1 is 
conflicted due to a collision between index 340 and the index generated by a previously loaded 
entry. Hash CAM blocks 2, 3 and 4, on the other hand, are unconflicted and each asserts its 
block flag and outputs the fixed insert priority value. Thus, each of the hash CAM blocks 2, 3 
and 4 provides the same match-qualified priority value to the device priority logic which, in this 
example, is hardwired to resolve ties in favor of the lowest numbered hash CAM block. 
Accordingly, as indicated by insertion address 430, hash CAM block 2 is selected to store the 
new entry in this example. 

[0076] Figures 27 and 28 illustrate NFA operations within a hash CAM device 439 that is 
identical to the exemplary hash CAM device 429 of Figures 25 and 26 except that the hash CAM 
device 439 implements a capacity-dependent insertion policy. In the examples of Figures 27 and 
28, each hash CAM block responds to a NFA instruction by outputting a priority value that is 
proportional to its fill level. Referring specifically to Figure 27, hash CAM blocks 2, 3 and 4, 
being completely unfilled, output numerically lower insert priority values than hash CAM block 

1, which contains a previously loaded entry. In an embodiment in which numerically lower 

v. 

values indicate a higher priority than numerically higher values (the case in the example of 
Figure 27), hash CAM blocks 2, 3, and 4 will each output a match-qualified priority number 
having a higher priority than the match qualified priority number output by hash CAM block 1, 
with hash CAM block 2 ultimately being selected by the device priority logic (i.e., by hardwired 
tie resolution logic) for entry insertion. Note that, had fixed insert priority values been used, 
hash CAM block 1 would have sourced the same match qualified priority number as hash CAM 
blocks 2, 3 and 4 and would have been selected for entry insertion by the hardwired tie resolution 
logic. 
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[0077] Figure 28 illustrates an exemplary NFA operation within the hash CAM device 439 of 
Figure 27 after a number of entries have been loaded into the hash CAM blocks 319i-3194. In 
this example, hash CAM block 1 (319i) is conflicted, but hash CAM blocks 2, 3 and 4 are not. 
Hash CAM blocks two and three each contain four previously loaded entries, however, while 
hash CAM block 4 contains only three previously loaded entries. Consequently, hash CAM 
block 4 (3 194) will output a lower priority number than hash CAM blocks 2 and 3, and therefore 
be selected for entry insertion. Note that, had fixed insert priority values been used, hash CAM 
blocks 2, 3 and 4 would each have output the same match-qualified priority number, thereby 
causing CAM block 2 to be selected for entry insertion according to the hardwired tie resolution 
policy. 

[0078] Although Figures 27 and 28 illustrate NFA examples using a least-filled insertion 
policy, a most-filled insertion policy (i.e., policy which favors a more-filled rather than a less 
filled hash CAM block for entry insertion purposes) may be applied instead. A most- filled 
insertion policy may be implemented, for example, by using an empty counter rather than the fill 
counter 357 within the output logic 335 of Figure 18. The empty counter may be preloaded to a 
predetermined value, then decremented in response to each insertion event (i.e., insertion of an 
entry within the corresponding hash CAM block) and incremented in response to each delete 
event. Referring to Figure 27, in such an embodiment, hash CAM block 1 would output a 
numerically lower (higher priority ) priority value than hash CAM blocks 2, 3 and 4 due to the 
previously inserted entry. Similarly, if a most-filled insertion policy was used in the example of 
Figure 28, either hash CAM block 2 or 3 would be selected over hash CAM block 4 due to the 
fact that hash CAM blocks 2 and 3 are more filled than hash CAM block 4. Note that hash CAM 
block 1 remains conflicted and therefore unable to store the candidate entry, and that the 
selection between hash CAM blocks 2 and 3 may be determined by a hardwired or 
programmable tie resolution policies. 
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Hash CAM Device having Shared Key/Hash Index Generation Circuitry 
[0079] Figure 29 illustrates an alternative embodiment of a CAM device 450 that includes 
multiple hash CAM blocks 445. In the CAM device 450, the hash CAM blocks 445 are 
organized into G groups of M hash CAM blocks each, with each hash CAM block operating as 
described in reference to the previous figures, and with each group of hash CAM blocks sharing 
a respective assembler/mask circuit 441 i-441g and hash index generator 443i-443g- A group 
configuration register 447i-447 G is associated with each group of hash CAM blocks and used to 
control the operation of the corresponding assembler/mask circuit. Figure 30 illustrates a more 
detailed embodiment of the assembler/mask circuit 441 and hash index generator 443 of Figure 
29. As shown, the assembler/mask circuit 441 includes K assembler/mask subcircuits, 451 1- 
45 Ik, each coupled to provide a respective masked key, 194i-194 K5 to a corresponding one of K 
hash function circuits 453j-453k within the hash function generator 443. Each assembler/mask 
subcircuit includes an assembler subcircuit and key mask logic subcircuit that operate in the 
manner described above in reference to the assembler circuit 191 and key mask logic 193 of 
Figure 6 to generate an assembled, masked key. In one embodiment, the group configuration 
register 447 includes a set of K entry type values that are input, respectively, to the K 
assembler/mask subcircuits 45 lj -45 Ik to enable each assembler/mask subcircuit to be configured 
to output a different masked key based on the same incoming data value. By this arrangement, 
each hash function circuit 453 i-543k may receive a different masked key. Each of the M hash 
CAM blocks within a given group (i.e., 445i-554 M ) includes a configuration circuit that may be 
programmed with a respective hash function select value (HFSEL) and includes a hash function 
select circuit to select, according to the programmed hash function select value, one of the K 
hash indices, HIi-HIk, generated by the hash index generator 443. Accordingly, each hash CAM 
block within the group may receive a hash index which has been generated according to a 
selected entry type, key mask and hash function. Comparing the hash CAM device 450 to the 
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hash CAM device of Figure 15, the total number of hash index generator circuits is reduced by a 
factor of M while retaining independent configurability for each hash CAM block 445. Note that 
in an alternative embodiment, the groups of hash CAM blocks depicted in Figure 29 need not all 
include the same number of CAM blocks. Also, while embodiments of hash CAM blocks are 
described below as including a dedicated assembler circuit, key mask logic and hash index 
generator, such hash CAM blocks may be modified to operate with the shared assembler/mask 
circuit 441 and shared hash index generator 443 of Figures 29 and 30. 

Hash CAM Block with Segmented Memory 
[0080] Figure 3 1 illustrates an embodiment of a hash CAM block 470 that includes a 
segmented memory 471 and which may be used within any of the multiple hash CAM block 
devices or single hash CAM block devices described above. The hash CAM block 470 includes 
an assembler circuit 191, key mask logic 193, hash index generator 131 and block select logic 
131 that operate generally as described above in reference to Figure 1 6. The segmented memory 
471 includes an address decoder 195, read write circuit 481 and segmented memory array 477. 
The address decoder 195 responds to a hash index from the hash index generator 131 by 
activating one of a plurality of word lines 204 in the manner described above in reference to 
Figure 6. Each word line 204 is shared among each of four segments 479i-4794 within the 
segmented memory array 477 (more or fewer segments may be used to form the memory array in 
alternative embodiments), so that the activated word line enables read and write access to a row 
of storage cells that spans all four segments. During a NFA or search operation, the contents 
within the entire row of storage cells (i.e., the contents of the word-line indicated location within 
each of segments 479i-479 4 ) are sensed by a set of four sense amplifier banks (SA) within the 
read/write circuit 481 and output as a set of four entry segments 138i-1384 to an output logic 
circuit 475. During an insert operation, a segment address 478 (SA) is provided to the read/write 
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circuit 481 to select one or more of write driver banks (WD) within the read/write circuit 481 to 
store an entry within the corresponding segment or segments 479. 
[0081] The hash CAM block 470 includes a configuration register that is similar to the 
configuration register 137 described in reference to Figures 6 and 16, except that the entry type 
value within the configuration register includes an entry size field which specifies the size (i.e., 
number of constituent bits) of entries stored within the segmented memory 471 . In the 
embodiment of Figure 3 1 for example, entries which span 1 , 2, or 4 segments may be stored 
within the segmented memory. Such values are referred to herein as xl , x2, and x4 entries and 
correspond to xl 5 x2, and x4 operating modes of the CAM block 470 selected by the entry size 
field. In alternative embodiments that include more or fewer memory segments 479, entries may 
span different numbers of segments. 

[0082] The entry size field within the entry type value (collectively designated an entry 
type/size value in Figure 31 to emphasize the presence of entry size information) is provided to 
the read/write circuit 481 to control which of the segments 479 are accessed in a read or write 
access to the segmented memory array 477. For example, during an insert operation in the xl 
mode, a 2-bit segment address 478 is provided to the read/write circuit 481 to enable one of four 
write driver circuits (WD) within the read/write circuit 481 to store a xl entry in a corresponding 
one of the memory array segments 479. During an insert operation in the x2 mode, the least 
significant bit of the 2-bit segment address 478 is ignored, and the most significant bit of the 
segment address value is used to enable one of two pairs of write driver circuits to store a x2 
entry in segment pair 479i/4792 or 4793/4794, respectively. During an insert operation in the x4 
mode, both bits of the segment address 478 are ignored and all four write driver circuits within 
the read/write circuit 481 are enabled to store a x4 entry in the memory array 477. In all three 
operating modes, the word line activated by the address decoder 195 selects the row of the 
memory array 477 in which the entry is stored. 
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[0083] The entry size information within configuration register 473 is also provided to the 
output logic 475 to control the generation of a block flag signal 474, block priority value 472 and 
block index 476. The output logic 475 is coupled to the sense amplifier banks (S A) within the 
read/write circuit 481 to receive an entry segment 138i-1384 from each of the corresponding 
segments 479 1 -4794 of memory array 471 . Depending on the operating mode selected by the 
entry type/size value within the configuration register 473, the entry segments 138i-138 4 may be 
four distinct entries (xl mode) that each span a single memory segment 479, two distinct entries 
(x2 mode) that each span a pair of memory segments, or a single entry that spans all four 
memory' segments (x4 mode). The output logic 475 also receives the search key 210 from the 
assembler circuit 191, hash index 340 from the hash index generator 131,. block select signal 332 
from the block select logic 331, and NFA signal 212 from an instruction decoder (not shown in 
Figure 31) or other control circuit. 

[0084] Figure 32 illustrates a more detailed embodiment of the output logic 475 of Figure 3 1 
(other embodiments may be used). The output logic 475 includes four compare logic circuits 
135 i-l 354, segment priority logic 491, flag logic 489 and segment index encoder 493. Each of 
the compare logic circuits 135j-1354 receives a respective entry segment 138i-1384 (i.e., from 
memory segments 479i-4794 of Figure 3 1) and operates generally as described in reference to 
Figure 9 to generate a corresponding match/empty signal, referred to herein as a segment flag 
(SF). During a NFA operation, each segment flag, SF1-SF4, indicates the state of the valid bit for 
the corresponding segment, and therefore indicates whether the corresponding segment is 
occupied (i.e., by a valid entry or portion thereof) or unoccupied. During a search operation, 
each segment flag indicates whether the entry key for the corresponding segment (i.e., KEYsi, 
KEYs2, KEYs3, KEYs4) matches the search key 210, ignoring any mismatch between bits 
indicated to be masked by the entry mask information for the segment (i.e., MSK S i, MSK S 2, 
MSKs3, MSKs4). In both NFA and search operations, the segment flags are output to the flag 
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logic circuit 489 which generates the block flag 474 (BF) in accordance with the entry size value 
from the configuration register and the block select signal 332. The segment flags are also 
output to the segment priority logic 491 which generates a block priority value (BP) and a 
plurality of index select signals, IS1-IS4. The index select signals are input to the segment index 
encoder 493 which encodes the index select signals to generate a segment index 488 (SIN). In a 
NFA operation, the segment priority logic 489 selects the segment flags, SF1-SF4, to be output 
as the index select signals, while in a search operation, the entry priority values (PRIsi, PRIs2, 
PRIs3, PRIs4) are qualified by the segment flags and used to generate the index select signals. 
The NFA signal 212 is input to the segment priority logic 491 to select the source of the index 
select signals. 

[0085] Figure 33 illustrates an embodiment of the flag logic 489 of Figure 32 (other 
embodiments may be used). The flag logic 489 receives the entry size information from the 
configuration register (i.e., element 473 of Figure 31) in the form of three component entry size 
signals: xl, x2 and x4. The xl signal is asserted (i.e., to a logic high state in this example) when 
the CAM device is configured for xl mode, the x2 signal is asserted in the x2 mode, and the x4 
signal is asserted in the x4 mode. In xl mode, the asserted xl signal enables the output of OR 
gate 50 1 to pass through AND gate 5 1 1 and OR gate 5 1 7 to a first input of AND gate 519. A 
second input of AND gate 519 is coupled to receive the block select signal 332. Accordingly, in 
the xl mode, if the block select signal 332 is asserted and any one of the segment flags SF1-SF4 
are high (indicating an empty condition within the corresponding segment during a NFA 
operation and a match condition during a search operation), the block flag 474 will be asserted 
by AND gate 519. 

[0086] In the x2 mode, AND gate 5 1 3 is selected to drive the block flag 474 via OR gate 5 1 7 
and AND gate 519. AND gate 513 receives the output of OR gate 507 which receives outputs of 
AND gates 503 and 505. Each of AND gates 503 and 505 receives a respective pair of the 
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segment flag signals, SFi/SF 2 and SF3/SF4, so that an empty (or match) indication is indicated in 
the x2 mode only if SFi and SF2 are both asserted or SF3 and SF4 are both asserted. In the x4 
configuration, AND gate 515 is selected to drive the block flag 474 via OR gate 517 and AND 
gate 519. In the x4 mode, all four segment flags, SF1-SF4, are ANDed together in AND gates 
503, 505 and 509 to generate the signal input to AND gate 515. Thus, in the x4 mode, all four 
segment flags must indicate an empty status in order for the block flag 474 to be asserted during 
a NFA operation, and all four segments flags must indicate a match condition in order for the 
block flag 474 to be asserted during a search operation. In an alternative embodiment, three of 
the four segment flags could be ignored in the x4 mode and one segment flag in each pair of 
segment flags could be ignored in the x2 mode. 

[0087] Figure 34 illustrates an embodiment of the segment priority logic 491 of Figure 32 
(other embodiments may be used). The segment priority logic 491 includes search priority logic 
550, NFA priority logic 571, multiplexer 565 and multiplexer bank 570. The segment priority 
logic 491 receives the x2 and x4 entry size signals from the configuration register and also 
receives the segment flags, SF1-SF4, from the compare logic circuits and the corresponding 
priority values, PRIsi-PRIs4 5 from the segmented memory. As discussed above, the segment 
priority logic 491 outputs the index select signals, IS1-IS4, and also outputs the block priority 
value 472 (BP). The NFA signal 212 is coupled to the control inputs of multiplexers 570 to 
select the source of the index select signals, IS1-IS4. During a NFA operation, the asserted NFA 
signal 212 selects segment flags SF1-SF4 to be output as the index select signals IS1-IS4. For 
example, if segment flag SFj is high (indicating that the corresponding segment within the 
segmented memory is empty), index select signal ISj will also be high. By contrast, during a 
search operation, the NFA signal 212 is deasserted and search priority logic 550 is selected (i.e., 
by multiplexer bank 570) to drive the index select signals. Multiplexer 565 is responsive to the 
NFA signal 212 to select an insert priority value 574 (IP) from the NFA priority logic 571 to be 
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output as the block priority value 472 during a NFA operation, and to select a search priority 
value 564 (SP) from the search priority logic 550 to be output during a search operation (i.e., 
when the NFA signal 212 is deasserted). 

[0088] The search priority logic 550 includes logic OR gates 541 , 543, 545 and 547 to generate 
match-qualified priority values; comparator circuits C u C2 and C3 to compare the match 
qualified priorities; multiplexers 549, 551 and 553 to select a search priority value from among 
the match qualified priority values; and AND gates 555, 557, 559 and 561 to output the index 
select signals IS1-IS4 via multiplexer bank 570. Each of the OR gates 541, 543, 545, 547 outputs 
a respective match qualified priority QPsi, QPs2, QPs3, QPs4 according to the state of the 
corresponding segment flags SF r SF 4 . For example, if segment flag SFi is high (indicating a 
match condition), OR gate 541 will output the PRI S i value as the match-qualified priority value 
QPsi. If SFj is low, indicating a mismatch condition, OR gate 541 will output a match 
disqualified priority value QPsi in which all constituent bits are high (i.e., due to the inversion of 
the segment flag at the input to OR gate 541), which, in the embodiment of Figure 34, is the 
lowest possible priority value. OR gates 543, 545 and 547 are additionally coupled to receive the 
x4 signal so that, in the x4 mode, the priority values QPs2> QPs3 and QPs4 are driven to the 
lowest possible priority value (i.e., disqualified because they are unused). Consequently, in the 
x4 configuration, the match qualified priority values for segments 2-4 of the segmented memory 
will not exceed the match qualified priority value for segment 1 and therefore are effectively 
ignored. That is, the segment 1 priority value, PRIsi, is the priority value for a x4 entry. In 
alternative embodiments, any of others of the segments, including combinations of two or more 
segments may be used to source the x4 priority value. OR gates 543 and 547 are further coupled 
to receive the x2 signal so that, in the x2 mode, the priority values for segments 2 and 4 are 
ignored, the x2 entries being represented by priority values stored in segments 1 and 3. In 
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alternative embodiments, others of the segments, including combinations of segments may be 
used to source the x2 priority values and/or the x4 priority value. 

[0089] Comparator C\ compares priority values QPsi and QPs2 and generates a select signal 
552 having a high logic state if priority value QPsi indicates a higher priority (or equal priority) 
than priority value QP S 2, and a low state if priority value QP S i indicates a lower priority than 
priority value QPs2- That is, select signal 552 is high or low according to whether priority value 
QPsi or QPs2, respectively, is the winner of the priority comparison in comparator C\. The select 
signal 552 is input to multiplexer 549 to select the Ci comparison winner (i.e., priority value 
QPsi or QPs2) to be output to multiplexer 553 and comparator C 3 . Similarly comparator C 2 
compares the priority values QP S 3 and QP S 4 and generates a select signal 554 having a high or 
low state according to whether priority value QPs3 or QP S 4 is the comparison winner. The select 
signal 554 is input to multiplexer 551 to select the C2 comparison winner (i.e., priority value 
QP S 3 or QPs4) to be output to multiplexer 553 and comparator C 3 . Comparator C 3 compares the 
priority numbers output by multiplexers 449 and 551 and generates a select signal 556 having a 
high or low state according to whether the priority value from multiplexer 449 or 55 1 is the 
comparison winner. The select signal 556 is provided to a select input of multiplexer 553 to 
select the C3 winner to be output as the search priority value 564 (SP). The select signals 552, 
554 and 556 are also input to AND gates 555, 557, 559 and 561 to enable one of the AND gates 
to assert a high logic signal according to which priority value, QPsi, QPs2, QPs3 or QPs4, is the 
overall priority comparison winner. Thus, if select signals 552 and 556 are both high, then the 
output of AND gate 555 is high to indicate that QPsi is the priority comparison winner. 
Similarly, if select signal 552 is low and 556 high, the output of AND gate 557 is high to indicate 
that QPs2 is the priority comparison winner; if select signal 556 is low and 554 high, the output 
of AND gate 559 is high to indicate that QP S 3 is the priority comparison winner; and if select 
signal 556 and 554 are both low, the output of AND gate 561 is high to indicate that QP S 4 is the 



Atty. Docket No. NLMI.P1 12C 



-52- 



priority comparison winner. Thus, when in the xl configuration, the memory segment that 
sources the highest priority match-qualified priority value will cause the corresponding index 
select signal, ISi-IS 4? to be asserted. 

[0090] NFA priority logic 571 is provided to generate the insert priority value 574. As 
discussed above, the insert priority value 574 may be compared with insert priority values from 
other hash CAM blocks during a NFA operation to select a hash CAM block for entry insertion. 
In the embodiment of Figure 34, the NFA priority logic 571 includes a multiplexer 572 to select 
between two predetermined priority values according to whether the indexed row within the 
segmented memory contains at least one occupied segment. More specifically, if the indexed 
row includes at least one occupied segment (indicated by a logic low segment flag), the output of 
AND gate 573 will be low, thereby selecting a first predetermined priority value (zero in this 
example) to be output as the insert priority value 574. Conversely, if all the segment flags 
indicate empty, an empty row within the segmented memory has been selected by a hash index 
and second predetermined priority value of (100H in this example, the 'H' indicating 
hexadecimal notation) is selected to be output as the insert priority value 574. In one 
embodiment, the first predetermined priority value has a higher priority than the second 
predetermined priority so that, a hash CAM block which is indexed at a partially filled row of a 
segmented memory will output a higher insert priority value than a hash CAM block which is 
indexed at a completely empty row. (Note that the NFA priority logic 571 need not account for a 
completely filled row because the block flag 474 will not be asserted in that circumstance). 
Different insertion policies may be used in alternative embodiments, however it is generally 
desirable to insert entries into partially filled rows within the segmented memory whenever 
possible in order to use the memory efficiently. 

[0091] Figure 35 illustrates alternative NFA priority logic 580 that may be used in place of 
NFA priority logic 571 of Figure 34. The NFA priority logic 580 generates an insert priority 
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value 582 having an R-bit fill count component 586 and at least one bit 588, referred to herein as 
a partial fill bit, that indicates whether or not at least one segment of the indexed memory row is 
occupied by at least one valid entry (i.e., a partially filled row). The R-bit fill count 586 is 
generated by a fill counter 585 which is incremented in response to an insert operation and 
decremented in response to a delete operation (i.e., an operation to invalidate an entry). 
Accordingly, the fill count 586 indicates the number of valid entries stored in the memory of a 
hash CAM block such that, in an embodiment in which numeric value is inversely proportional 
to priority, the less filled of two CAM blocks having matching partial fill bits 588 will output a 
higher priority value than the more filled CAM block. In the embodiment of Figure 35, insert 
and delete signals (INS and DEL) are applied, respectively, to up and down inputs of the fill 
counter 585, the insert and delete signals being generated, for example, by an instruction 
decoder. In alternate embodiment, the fill counter may count down in response to the insert 
signal and up in response to the delete signal so that the less filled of two CAM blocks having 
matching partial fill bits 588 will output a numerically higher (lower priority) priority value than 
the more filled CAM block. 

[0092] The partial fill bit 588 is generated by AND gate 573 according to the states of the 
segment flags SF1-SF4. Specifically, if any of the segment flags indicates an occupied segment 
(i.e., segment flag is low), the output of AND gate 573 goes low to produce a low partial fill bit 
588. Conversely, if all the segment flags indicate unoccupied segments (i.e., all segments flags 
are high), the partial fill bit 588 is high. Thus, in the exemplary embodiment of Figure 35, the 
partial fill bit 588 is 0 in the case of a partially (or completely) filled memory row, and 1 in the 
case of a completely unfilled memory row. In one embodiment, the partial fill bit is the most 
significant bit of the insert priority value 582 (or at least more significant than the fill count 586) 
so that a CAM block containing a partially filled row will output a numerically lower (i.e., higher 
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priority) insert priority value than a CAM block containing no valid entries within the indexed 
row (i.e., a completely unfilled row), regardless of the relative fill levels of the two CAM blocks. 
[0093] Figure 36 illustrates an embodiment of the segment index encoder 493 of Figure 32. 
The segment index encoder 493 receives the index select signals, IS1-IS4, from the segment 
priority logic and selects one of four predetermined segment indices, 00b, 01b, 10b or 1 lb (the 
'b' indicating binary notation) to be output as the segment index 488. Each of the predetermined 
indices is an address to one of the four different segments within the segmented memory. Thus, 
if during either a NFA operation or a search operation, index select signal IS| is asserted, value 
00 at port PI of a multiplexer 581 is output as the segment index 488. That is, if segment 1 
within the segmented memory is signaled to be unoccupied in a NFA operation, or signaled to 
contain a matching entry in a search operation, the address of that segment (i.e., 00b) will be 
output as the segment index 488. In one embodiment, the multiplexer 581 is designed to resolve 
ties (i.e., more than one index select signal being asserted) in favor of the lowest numbered of the 
index select signal. For example, if all four index select signals are asserted, the segment index 
that corresponds to index select signal ISi (i.e., 00b) is output as the segment index 488. 
Different tie resolution schemes may be used in alternative embodiments. 
[0094] Figure 37 illustrates a NFA operation within an exemplary CAM device 592 having 
four hash CAM blocks, 470i-4704, each of which includes a four-segment memory, and each of 
which outputs one of two insert priority values according to whether an indexed memory row is 
partially filled (i.e., each hash CAM block includes NFA priority logic 571 of Figure 34). For 
simplicity, each hash CAM block 470i-470 4 is depicted as having six memory storage rows 
(each with four segments), with hash CAM blocks 1 and 2 (470i, 470 2 ) programmed to select the 
hash index generated by a first hash function circuit 431 and hash CAM blocks 3 and 4 (470 3 , 
470 4 ) programmed to select a hash index generated by a second hash function circuit 433. Thus, 
for a given masked key value (MKEY), the hash indices generated within hash CAM blocks 1 
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and 2 will be the same (i.e., index 340), and the hash indices generated within CAM blocks 3 and 
4 will be the same (i.e., index 341), but, except in cases of coincidence, index 340 will be 
different from index 341 . Also for simplicity, the key mask values and entry type values 
programmed within the four hash CAM blocks are assumed to be the same (i.e., all four hash 
CAM blocks are in the same insertion pool). 

[0095] In the example of Figure 37, hash CAM block 1 (470i) includes five entries each stored 
within segment 1 (indicated by cross-hatched shading). Hash CAM blocks 2, 3 and 4 are each 
completely empty. Because hash CAM block 1 is indexed at a partially filled row, hash CAM 
block 1 will generate a numerically lower (higher priority) insert priority value than hash CAM 
blocks 2, 3 and 4. Thus, in Figure 37, hash CAM block 1 is selected for entry insertion instead 
of hash CAM blocks 2, 3 and 4. The insertion location is indicated by lined shading (i.e., at row 
2, segment 2 of hash CAM block 1). 

[0096] Figure 38 illustrates another exemplary NFA operation within the hash CAM device 
592 of Figure 37 after additional entries have been loaded into the hash CAM blocks 1, 2 and 3. 
Hash CAM block 4 is still completely empty. In this example, hash CAM blocks 1 and 2 are 
conflicted (i.e., the rows at the indexed location are fully loaded), and therefore will not assert a 
block flag signal. Hash CAM blocks 3 and 4, by contrast, both have completely empty rows at 
the indexed location. Because hash CAM blocks 3 and 4 both output the same priority number 
(the relative fill levels of the CAM blocks not being incorporated into the insert priority values), 
hash CAM block 3 is selected to store the entry according to a hardwired tie resolution policy 
that favors lower numbered hash CAM blocks. Different tie resolution policies may be 
implemented in alternative embodiments. 

[0097] Figures 39 and 40 illustrate NFA operations within a hash CAM device 594 that is 
identical to the exemplary hash CAM device 592 of Figures 37 and 38 except that the hash CAM 
device 594 includes the capacity-dependent NFA priority logic 580 of Figure 35. Referring 
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specifically to Figure 39, hash CAM blocks 1, 2, 3 and 4 are all indexed at completely unfilled 
rows so that that the partial fill bit of the insert priority value for each hash CAM block is set. 
Because the fill level of hash CAM block 2 is lower than that of hash CAM blocks 1, 3 and 4, 
however (hash CAM block 1 contains only one entry, while hash CAM blocks 2, 3 and 4 each 
contain more than one entry), hash CAM block 2 outputs a numerically lower (higher priority) 
priority value than hash CAM blocks 1, 3 and 4 and is therefore selected for entry insertion. 
[0098] In the example of Figure 40, only hash CAM blocks 1, 2 and 4 are able to store the data 
value within the indexed row (hash CAM block 3 is conflicted). Hash CAM block 2 is the least 
filled of hash CAM blocks 1,2 and 4, but is indexed at a completely empty row. Consequently, 
the most significant bit of the insert priority value output by hash CAM block 2 will be set. By 
contrast, hash CAM blocks 1 and 4 are indexed at partially filled rows and therefore will have a 
cleared most significant bit within their insert priority values. Thus, the insert priority values 
output by hash CAM blocks 1 and 4, will be numerically lower (higher priority) than the insert 
priority value output by hash CAM block 2, even though hash CAM block 2 is less filled than 
hash CAM blocks 1 and 4. Hash CAM blocks 1 and 4 will output equal priority values because 
each contains the same number of at least partly occupied rows (i.e., fill counter 585 of Figure 35 
will have been incremented each time an unfilled row within the binary CAM was loaded with a 
new hash index to enable storage of an entry within an unfilled row of the corresponding 
memory ). Because hash CAM blocks 1 and 4 both output the same priority number, hash CAM 
block 1 is selected to store the entry according to a hardwired tie resolution policy that favors 
lower numbered hash CAM blocks. Different tie resolution policies may be implemented in 
alternative embodiments. 

[0099] Although Figures 39 and 40 illustrate NFA examples using a least-filled insertion 
policy, a most-filled insertion policy (i.e., policy which favors a more-filled rather than a less 
filled hash CAM block for entry insertion purposes) may be applied instead. A most-filled 
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insertion policy may be implemented, for example, by using an empty counter rather than the fill 
counter 585 within the NFA priority logic of Figure 35. The empty counter may be preloaded to 
a predetermined value, then decremented in response to each binary CAM load operation (i.e., 
indicated by assertion of the INS signal) and incremented in response to each binary CAM delete 
operation (i.e., indicated by assertion of the DEL signal). Referring to Figure 39, in such an 
embodiment, hash CAM block 1 would output a numerically lower (higher priority ) priority 
value than hash CAM blocks 2, 3 and 4 because more of its rows are filled or at least partially 
filled (i.e., hash CAM block 4 has an occupied row count of 4, while hash CAM block 2 has an 
occupied row count of 2, and hash CAM blocks 2 and 4 each have an occupied row count of 1) . 

Indirect Hashing using Binary CAM 
[0100] Figure 41 illustrates another embodiment of a hash CAM block 600 which may be used 
within any of the multiple hash block CAM devices or single hash CAM block devices described 
above. The hash CAM block 600 includes an assembler circuit 191, key mask logic 193, hash 
index generator 131, block select logic 331 and configuration register 473, all of which operate 
as generally described above in reference to Figure 3 1 . The hash CAM block 600 additionally 
includes a binary CAM circuit 601 to perform a function referred to herein as indirect hashing. 
In indirect hashing, in the hash CAM block 600, the hash index 340 generated by the hash index 
generator 131 is not used to index the memory directly, but rather is compared with stored hash 
indices within the binary CAM 601 to identify a matching hash index within a storage row of the 
binary CAM 601, and to select a corresponding row within a memory array 477. 
[0101] Figure 42 illustrates the different results produced by direct hashing, in which a hash 
index is used to address the memory array 477 directly, and indirect hashing achieved through 
use of the binary CAM 601. As shown, the hash index generator 131 generates a hash index 340 
having N constituent bits. Because the memory array 477 only has 2 M storage rows, however, 
only M of the N bits within hash index 340 may be used to directly address the memory array 
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477. That this, in a direct hashing scheme, either the hash index generator 131 generates a hash 
index having a number of bits that corresponds to the size of the memory to be addressed, or the 
hash index is truncated to the appropriate number of bits before addressing the memory array. 
The truncation of T bits from the N-bit hash index is shown in the direct hashing example of 
Figure 42 (i.e., N=M+T). The truncation of T bits reduces the overall hashing space by a factor 
of 2 T , thereby increasing the likelihood of collisions relative to indirect hashing. By contrast, in 
the indirect hashing approach, the full set of N bits may be used (or at least more than M bits) to 
select a row of the memory array 477, even though the memory array 477 has fewer than 2 N 
storage rows. As shown, the binary CAM 601 is used to store the full N-bit hash index within a 
selected row of CAM cells during an insert operation, the selected row of CAM cells 
corresponding to a row within the memory array in which the entry is inserted. In a search 
operation, the hash index generated by the hash index generator 13 1 is compared with each of 
the hash indices stored within the binary CAM 601 to determine whether the hash index has 
previously been loaded in to the binary CAM 601 . If a match is found, a match line 
corresponding to the row of the binary CAM 601 containing the matching hash index is 
activated. In one embodiment, the match line is used to drive a corresponding word line within 
the memory array. By this arrangement, a matching hash index within the binary CAM enables 
read or write access to the corresponding storage row within the memory array 477. Thus, the 
binary CAM effectively presents a larger hash index space to the address generation circuitry 
(i.e., the hash index generator 131) than is provided by the underlying memory array 477. 
Because the full hash index 340 may be used (that is, rather than a truncated hash index), the 
likelihood of collisions between incoming data values is substantially reduced (i.e., as compared 
with direct hashing). Further, as shown in Figure 42 the binary CAM may be loaded a 
predetermined manner (e.g., bottom to top, top to bottom, etc.) such that the full space within the 
memory array may be used. This is in contrast to the direct hashing approach, in which the 
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memory array may be filled only up to a certain percentage before collisions become so 
statistically likely that the hash CAM block is, for practical purposes, full. 
[0102] In one embodiment, the hash index generator 131 generates a 16-bit hash index (a CRC 
value), the memory array contains 512 rows (i:e., addressable by a log2(512) = 9 bit address), and 
the binary CAM stores 12 bits of the hash index, thus truncating the hash index by four, but not 
all seven extra bits. In an alternative embodiment, a ternary CAM may be used to store the full 
hash index together with mask information to mask out selected bits of the hash index. For 
example, a ternary CAM having a 16-bit storage width may be used to store a full 16-bit hash 
index, with any number of the hash index bits masked to produce, in effect, a smaller hash index. 
The hash index bits to be masked, if any, may be configured by a value stored within the 
configuration register of the hash CAM block. By this arrangement, different hash CAM blocks 
may be configured to use unmasked portions of a hash index to further reduce the likelihood of 
collisions in the CAM device. It should be noted that not all the CAM cells within such a ternary 
CAM device (i.e., ternary CAM device used to support indirect hashing) need to be ternary CAM 
cells. That is, each row within the ternary CAM device could include a number of binary CAM 
cells and a number of ternary CAM cells according to application needs. 

[0103] In applications in which insertion policy favors insertion at partially filled rows, it may 
be desirable to increase the likelihood of a match within the binary CAM array 625 by masking 
selected bits of the hash index, effectively selecting a smaller hash index. In the embodiment of 
Figure 43, a hash index mask circuit 641 is provided to mask selected bits of the hash index 340 
in accordance with the entry size value within the block configuration register. As a specific 
example, for a xl configuration, a truncated, 12-bit hash index 340 is output from the hash index 
generator 131 and four of the bits of the hash index 340 are masked by the hash index mask 
circuit 641 hash for xl entries; two bits of the hash index 340 are masked by the hash index mask 
circuit 641 for x2 entries, and none of the bits of the hash index 340 are masked by the hash 
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index mask circuit 641 for x4 entries. This hash index masking policy reflects the significance 
of partially filled rows for the xl, x2 and x4 configurations. That is, there can be no partially 
filled row in the x4 configuration (row is either entirely occupied or unoccupied), and partially 
filled rows are statistically more likely in the xl configuration than the x2 configuration due to 
the fact that a partially filled row remains partially filled in the xl configuration until it has been 
selected for entry insertion four times (versus two times for the x2 configuration). Different 
masking levels may be used in alternative embodiments, including embodiments having more or 
fewer entry segments and therefore different configuration in which partially filled rows are 
possible. Also, different hash index truncation selections may be enabled in different hash CAM 
blocks within a given device. For example, in one embodiment some of the hash CAM blocks 
include memory arrays (and binary CAM arrays) having 512 storage rows, while others of the 
hash CAM blocks include memory arrays (and binary CAM arrays) having 256 storage rows. 
Different hash index truncation selections may be programmed for the different hash CAM 
blocks according to the total addressable storage space (and therefore the storage capacity) of the 
memory array. 

[0104] In an alternative embodiment, the hash index mask circuit 641 may be omitted and a 
ternary CAM array may be used in place of the binary CAM array 625. Mask values may then 
be stored with each hash index 340 loaded into the ternary CAM to effect the desired level of 
hash index masking. In another alternative embodiment, the hash index 340 is not masked for 
any of the entry size configurations (or to achieve truncation) so that the hash index mask circuit 
641 may be omitted altogether. 

[0105] Returning to Figure 41 , the binary CAM 601 is coupled to receive a hash index 340 
from hash index generator 131 and outputs a decoded row address 612 (DCRA) and a set of 
binary CAM signals 610 (BCMS) to a multiplexer 607. The multiplexer 607 is responsive to an 
operation select signal (OPSEL) to select either the decoded row address 612 or the binary CAM 
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match signals 610 to drive the word lines of the memory array 477. For example, during a NFA 
operation or search operation, the binary CAM match signals 610 are selected by the multiplexer 
607 to drive the word lines of the memory array 477. Accordingly, if a matching hash index is 
detected within the binary CAM 601 during an NFA or search operation, the asserted binary 
CAM match signal enables the corresponding row of the memory array 477 to be sensed by the 
sense amplifiers banks within the read write circuit 479 and provided to the output logic 609. 
During an insert operation (or other write operation), a row address 602 (RA) output from an 
address selector 605 is decoded within the binary CAM 601 to generate the decoded row address 
612. If the insert operation involves loading a new hash index into the binary CAM 601, the 
hash index is loaded into the binary CAM row indicated by the decoded row address 612. If the 
insert operation involves loading a new entry into a partially filled row of the memory array (that 
is, a row for which a corresponding hash index is already stored in the binary CAM 601), a hash 
index is not stored in the binary CAM 601. In either case, the decoded row address 612 is 
selected by the multiplexer 607 to activate the corresponding word line within the memory array 
477 and thereby enable an entry to be stored within the indicated storage row. 
[0106] The binary CAM 601 also outputs a binary CAM index 608 (BCIN) and binary CAM 
flag 606 (BCF) to the output logic 609 where they are used to generate a block index 620 (BIN), 
block priority value 618 (BP) and block flag 616 (BF). The block index 620 is fed back to the 
address selector 605 to update selected address registers therein. It should be noted that, instead 
of outputting a decoded address to the memory array 477, a separate address decoder may be 
provided to receive an encoded address from binary CAM 601, and to decode the encoded 
address to activate the indicated word line within the memory array 477. In such an 
embodiment, the multiplexer 607 may receive the row address 602 output by the address selector 
(i.e., instead of the decoded row address 612) and the binary CAM index 608 output by the 
binary CAM 601 (i.e., instead of the binary CAM match signals). During an NFA or search 
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operation, the binary CAM index 608 is selected to address the memory array 477, and during 
other operations (e.g., an insert operation), the row address 602 is selected to access the memory 
array 477. 

[0107] In the embodiment of Figure 41, the memory array 477 is a segmented array that 
operates in the manner described above in reference to Figure 3 1 . Accordingly, the address 
selector 605, in addition to outputting the row address 602 to the binary CAM 601 , also outputs 
a segment address 604 (SA) to the read write circuit 479. The segment address is used to select 
one or more write driver or sense amplifier banks within the read/write circuit 479 to perform a 
write or read at the indicated segment(s) of the word-line selected row. As in the embodiment of 
Figure 31, entry size information stored within the configuration register 473 is provided to the 
read/write circuit to control the number of memory segments accessed during a given insert, 
search or NFA operation. In an alternative embodiment, an unsegmented memory array (i.e., 
memory containing a single segment) may be used in place of memory array 477. Also, while 
memory array 477 is described below as having four segments and corresponding numbers of 
supporting circuits, more or fewer segments may be included within the memory array 477 in 
alternative embodiments. Further, in a single segment embodiment, compare logic (e.g., element 
135 of Figure 6) may be used in place of the output logic 609. 

[0108] Figure 43 illustrates an embodiment of the binary CAM 601 and the address selector 
605 of Figure 41. The binary CAM 601 includes an address decoder 629, binary CAM array 
625, binary CAM read/write circuit 627, binary CAM flag logic 631, and binary CAM priority 
encoder 633. The address decoder 629 is coupled to receive the row address 602 from the 
address selector 605, and decodes the row address to activate an indicated one of word lines 624. 
The signals carried on word lines 624 constitute the decoded row address 612 and are output 
from the binary CAM 601 as shown. The binary CAM array 625 includes a plurality of rows of 
CAM cells each including a storage element and a compare circuit to enable an input hash index 
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340 to be simultaneously compared with the contents of each binary CAM row. The binary 
CAM read/write circuit 627 is used to store a hash index 340 in the binary CAM array 625 
during a binary CAM load operation. The binary CAM read/write circuit 627 may also be used 
to read the contents of the binary CAM array 625, for example, for test purposes. During a NFA 
operation or a search operation, a hash index 340 is compared with the contents of each row of 
CAM cells within the binary CAM array 625 to generate a plurality of binary CAM match 
signals 610. Each binary CAM match signal indicates whether the contents of the corresponding 
binary CAM row matches the hash index, and is output via a respective one of the binary CAM 
match lines 626 (ML). The binary CAM flag logic 63 1 and the binary CAM priority encoder 
633 are each coupled to the binary CAM match lines 626 to receive the binary CAM match 
signals 610. The binary CAM flag logic 63 1 outputs the binary CAM flag 606 (BCF) according 
to whether a match is detected within the binary CAM 601 during a search or NFA operation. 
The binary CAM priority encoder 633 outputs the binary CAM index 608 (BCIN) according to 
which, if any, of the binary CAM match signals 610 is asserted. Thus, the binary CAM index 
608 is a row address that corresponds to a row of CAM cells within the binary CAM array 625 
that contain a hash index that matches the input hash index 340. In one embodiment, the binary 
CAM priority encoder 633 is a multiplexer circuit that selects one of a plurality of row address 
values according to which of the binary CAM match signals 610, if any, is asserted during a 
binary CAM search operation. In an alternative embodiment, the binary CAM priority encoder 
633 is a lookup table (e.g., a read only memory) that is indexed by binary CAM match signals 
such that an asserted one of the binary CAM match signals 610 selects a corresponding row 
address value to be output from the lookup table as the binary CAM index 608. Note that, in an 
embodiment of the binary CAM in which only one of binary CAM match signals 610 is asserted 
for a given search (i.e., no multiple match) the priority encoder 633 may be replaced by an 
encoder circuit that encodes the asserted one of the match signals 610 into a row address. The 



Atty. Docket No. NLMLP1 12C 



-64- 



encoder circuit may be implemented, for example, by a multiplexer circuit or a lookup table as 
discussed above. 

[0109] During a load operation in the binary CAM 601, a hash index 340 is stored within a 
selected row of CAM cells within the binary CAM array 625, and validity values for each of the 
CAM rows are output on the match lines to indicate whether the corresponding row of CAM 
cells within the binary CAM array 625 are occupied. The binary CAM flag logic 631 generates 
the binary CAM flag 606 according to the state of the validity values. If the validity values 
indicate that all the rows of the binary CAM array 625 are occupied by valid hash indices, the 
binary CAM array 625 is full, and the binary CAM flag 606 is deasserted to indicate the full 
condition. The binary CAM priority encoder 633 also operates on the validity signals to generate 
a binary CAM index 608 that is indicative of a next free address within the binary CAM array 
625. That is, the binary CAM priority encoder 633 generates an address of the highest priority 
location within the binary CAM array 625 indicated to be unoccupied by a corresponding 
validity signal. 

[0110] In one embodiment, the binary CAM index 608 generated during a binary CAM load 
operation is buffered in a next free binary CAM address (NFBA) register 63 1 within the address 
selector 605. As discussed below, if no match is detected within the binary CAM array 625 
during a subsequent NFA operation, the address value within the NFBA register 63 1 is selected 
by multiplexer 639 to be output as row address 602 and is selected by output logic 609 of Figure 
41 to be a row index portion of the block index 620. 

[0111] The address selector 605 additionally includes a partially filled row (PFR) register 633, 
a highest priority match (HPM) register 635 and an address counter 637 (CNTR), all of which 
are used to store address values. The partially filled row register 633 is used to store the block 
index 620 generating during a NFA operation that indexes a partially filled row (i.e., as opposed 
to a NFA operation in which no match is detected in the binary CAM 601). The HPM register 
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635 is used to store the block index 620 generated during a search operation that results in a 
match detection. Address counter 637 may be selected by multiplexer 639 to provide a sequence 
of ascending or descending indices to the binary CAM 601 and memory array 477, for example, 
to enable the binary CAM 601 and memory array 477 to be sequentially loaded with entries (e.g., 
during system startup). In alternative embodiments more or fewer address sources may be 
included within and/or selected by the address selector 605. For example, an error address 
register or test address generator may be included within the address selector 605 and selected by 
the multiplexer 639 to provide a sequence of error check addresses to the binary CAM 601 
and/or memory array 477. In the embodiment of Figure 43, the multiplexer 639 is responsive to 
address select information 632 (ASEL) from an instruction decoder to select one of the address 
sources. In one embodiment, an address bus 230 is also coupled to a input port of the 
multiplexer 639 to enable externally generated addresses to be selected to address the binary 
CAM array 601 and/or the memory array 477. The address selector 605 outputs the row address 
portion 602 (RA) of the selected address to the address decoder 629 within the binary CAM 601 
and also to the output logic 609 of Figure 41. The address selector 605 also outputs a segment 
address portion 604 (SA) of the selected address to the read/write circuit 479 of Figure 41. As 
discussed above, the segment address 604 is used to control access to individual segments within 
the memory array during insert, read and other memory access operations. 
[0112] Figure 44 illustrates an embodiment of the output logic 609 of Figure 41 (other 
embodiments may be used). The output logic 609 includes compare logic circuits 135i-135 4 , 
flag logic 645, segment priority logic 647 and segment index encoder 649, all of which perform 
the same general functions as corresponding circuit blocks within the output logic 475 of Figure 
32. Also, like the output logic 475 of Figure 32, the output logic 609 receives the segment 
entries 138i-138 4 , search key 210, block select signal 332 and NFA signal 212 as inputs. The 
output logic 609 additionally receives the binary CAM flag 606 (BCF), binary CAM index 608 
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(BCIN) and row address 602 (RA) as inputs. The binary CAM flag 606 is input to the segment 
priority logic 647 and also to the flag logic 645 and segment index encoder 649. The binary 
CAM index 608 and row address 602 are alternatively selected to be the row index component 
652 of the block index 620 depending upon whether the binary CAM flag 606 is asserted. 
During a NFA operation, the next free binary CAM address is selected (e.g., from the NFBA 
register 631 within the address selector 605 of Figure 43) to drive the row address 602 so that the 
multiplexer 651 effectively selects either the binary CAM index 608 or the next free binary 
CAM address to be the row index 652. Thus, if during a NFA operation, no match is found 
within the binary CAM (meaning that the hash index has not previously been stored in the binary 
CAM), the binary CAM flag 606 is not asserted and the multiplexer 651 selects the next free 
binary CAM address to be the row index 652. If the binary CAM flag is asserted during the 
NFA operation (meaning that the hash index has previously been stored within the binary CAM), 
the binary CAM index 608 is indicative of the location of a matching hash index within the 
binary CAM and is selected to be the row index 652. 

[0113] Still referring to Figure 44, the compare logic circuits 135i-1354 operate as described 
above in reference to Figure 32 to generate segment flags SF1-SF4. Each segment flag 
corresponds to a respective segment within the segmented memory and indicates, during a NFA 
operation, an empty/occupied status of the segment. During a search operation, each segment 
flag indicates whether the content of the corresponding segment matches the search key 210 (or 
portion thereof). 

[0114] Figure 45 illustrates the flag logic 645 of Figure 44 according to one embodiment. The 
flag logic 645 receives the segment flags SF1-SF4 and entry size signals, xl, x2 and x4 as inputs 
includes the logic gates 501, 503, 505, 507, 509, 51 1, 513, 515 and 517 described above in 
reference to Figure 33. The flag logic additionally receives the binary CAM flag 606 and NFA 
signal 212 and includes a storage element 667 (a D-flip flop in this example) to store a buffered 
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binary CAM flag 662 in response to a write strobe signal 660 (WSTB). Logic gates 501 , 51 1 
and 517 operate as described in reference to Figure 33 to output a logic high signal if the xl 
signal is high (i.e., xl mode selected) and any of segment flags SF1-SF4 are high (indicating a 
match detection during a search operation, or detection of an unoccupied segment during a NFA 
operation). Because the segment flags are only meaningful if a binary CAM match was detected 
during a search or NFA operation (otherwise, no match signal was asserted by the binary CAM 
to select an entry to be input to the output logic 609) and if the CAM block is allocated to the 
entry type pool or insertion pool indicated by the corresponding search or NFA instruction, the 
output of logic OR gate 517 is ANDed with the binary CAM match flag 606 and the block select 
signal in AND gate 661 . The output of AND gate 661 , if high, will cause OR gate 663 to assert 
the block flag 616. Logic gates 503, 505, 507, 513 and 517 also operate as described in 
reference to Figure 33 to output a logic high signal if the x2 signal is high and either pair of 
segment flags SFi/SF 2 or SF3/SF4 indicate a x2 match/empty condition, and logic gate 503, 505, 
507, 509 and 515 operate as described in reference to Figure 33 to output a logic high if the x4 
signal is high and all four segment flags SF1-SF4 are asserted (i.e., to indicate a x4 match/empty 
condition). Because the output of OR gate 517 is ANDed with the block select signal 332 and 
the binary CAM flag 606, as in the xl mode, the block flag 616 will be asserted in response to a 
match/empty detection in the x2 and x4 modes only if the block select signal 332 is high and a 
match is detected in the binary CAM (i.e., binary CAM flag 606 high). 

[0115] The flag logic 645 includes an additional path for assertion of the block flag 616 in the 
event that the binary CAM flag 606 is not asserted during a NFA operation, but the binary CAM 
is not full. That is, if the binary CAM flag 606 is not asserted during the NFA operation then no 
matching index was located within the binary CAM; a result indicating that the hash CAM block 
is not conflicted. Accordingly, so long as the binary CAM is not full, the new hash index (i.e., 
the storage index generated in response to the entry candidate) may be stored in the next free 
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address of the binary CAM and the entry candidate may be stored in the corresponding row of 
the memory. In the embodiment of Figure 45, the write strobe signal 660 is asserted (e.g., by an 
instruction decoder) during a binary CAM load operation; an operation during which the binary 
CAM flag 606 is low if the binary CAM is full and high if the binary CAM is not full. Thus, the 
buffered binary CAM flag 662 indicates whether the binary CAM is full. The buffered binary 
CAM flag 662, block select signal, and NFA signals 212 are supplied to non-inverting inputs of 
AND gate 665, and the binary CAM flag 606 is supplied to an inverting input of AND gate 665. 
Consequently, AND gate 665 outputs a high signal to OR gate 663 (resulting in assertion of the 
block flag 616) if the hash CAM block is selected to participate in a NFA operation (block select 
signal 332 and NFA signal 212 are high) in which no match is detected in the binary CAM 
(binary CAM flag 606 is low) and the binary CAM is not full (buffered binary CAM flag 662 is 
high). 

[0116] Figure 46 illustrates a NFA priority logic embodiment 670 that may be included within 
the segment priority logic 647 of Figure 44 (other embodiments may be used). The NFA priority 
logic 670 generates an insert priority value 672 in the manner described in reference to the NFA 
priority logic 580 of Figure 35, except that OR gate 671 is provided to force partial fill bit 674 to 
a logic high state (thereby indicating a completely empty row) if the binary CAM flag 606 is not 
high. That is, if no match is detected in the binary CAM during a NFA operation, an ensuing 
entry insertion within the hash CAM block (i.e., if the hash CAM block is selected for entry 
insertion) will occur in a completely unfilled row. If a match is detected in the binary CAM, the 
binary CAM flag 606 will be high to enable AND gate 573 to control the state of the partial fill 
bit 674. AND gate 573 will drive the partial fill bit 674 high if any of the segment flags SF r SF 4 
indicate an empty segment. 

[0117] The fill counter 585 within priority logic 670 is incremented in response to a binary 
CAM load (indicated by a BCWR signal) and decremented in response a binary CAM delete 
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(indicated by a BCDEL signal). By this arrangement, the fill counter 585 indicates the number 
of rows within the memory array which are at least partially filled. As discussed above, different 
fill measures may be used in alternative embodiments, and the fill count may be omitted from the 
insert priority value 672 altogether. 

Figure 47 illustrates an embodiment of the segment index encoder 649 of Figure 44 (other 
embodiments may be used). The segment index encoder 649 operates similarly to the segment 
index encoder 493 of Figure 36, except that an additional multiplexer 675 is provided to force 
the segment index 654 to zero if the binary CAM flag 606 is not asserted. Thus, if no match is 
found within the binary CAM during a NFA operation, any subsequent insertion into the 
memory array will occur within a completely unfilled row and therefore will be directed to a 
predetermined segment of the row (in this example, the segment indicated by segment address 
00b). Note that, in an alternate embodiment, the binary CAM flag 606 may be inverted, then 
logically ORed with index select signal IS\ to achieve the same result achieved by the addition of 
multiplexer 675. 

[0118] Figure 48 is a flow diagram of a NFA operation within a hash CAM device that 
includes N hash CAM blocks each according to Figure 41 . At 691 1 -69 In, concurrent NFA 
operations are performed within each of the N hash CAM blocks. In one embodiment, a NFA 
operation is also performed within the overflow CAM block at 691 N +i (i.e., concurrently with 
operations 691 i-691n) to determine whether the overflow CAM block includes any storage rows 
that are at least partly unoccupied and that have been tagged with an entry type that matches the 
class code specified by the NFA instruction. If such a storage row is detected, the overflow 
CAM outputs a predetermined priority value and asserts its block flag. This operation of the 
overflow CAM is discussed below in greater detail. 

[0119] Still referring to Figure 48, at 693 the block flags are priority encoded according to 
their corresponding block priority values to generate a block identifier (BID). The block index 
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that corresponds to the block identifier is selected as the block index component of the device 
index (i.e., BINsel). At 695, the block identifier and selected block index are output as the 
device index (DIN), and the device flag is generated, for example, by ORing the block flags from 
the hash CAM blocks and the overflow CAM block. At 697, the device flag is evaluated to 
determine whether a CAM block (hash CAM block or overflow CAM block) within the CAM 
device is able to store the entry candidate. If the device flag is not asserted, the hash CAM 
device is unable to store the entry (699). If the device flag is asserted, the entry is inserted at the 
CAM block and memory location indicated by the device index (701). The entry may be 
inserted at the indicated location without further host instruction (i.e., CAM device automatically 
inserts the entry in response to assertion of the device flag), or, alternatively, in response to a 
separate insert instruction from a host device. 

[0120] Figure 49 is a flow diagram of a NFA operation within an one of the hash CAM 
blocks of Figure 41, i being any integer value from 1 to N. At 709, the class code of the NFA 
instruction (CC) is compared with the entry type of the CAM block (ETi), and the mask code of 
instruction (MC) is compared with the key mask of the CAM block (KM;) . If the class code 
does not match the entry type or the mask code does not match the key mask, the block flag is 
deasserted at 727 and the NFA operation within the hash CAM block is concluded. If the class 
code matches the entry type and the mask code matches the key mask, a key is generated at 71 1 
and used to generate a hash index (Hlj) at 713. At 715, the binary CAM is searched for a match 
with the hash index generated at 713. If a matching hash index is detected within the binary 
CAM at 717, the corresponding row of the memory array is output to the output logic to 
determine whether the row is full (719). If the row is not full, a partially filled row has been 
detected. Accordingly, at 723, the binary CAM fill count is output as the block priority value 
(the partial fill bit being cleared to 0), the block flag is asserted, and the binary CAM index 
(BCIN) and segment index (i.e., address of an unoccupied segment within the partially filled 



Atty. Docket No. NLMI.P1 12C 



-71 - 



row) are output as the block index. If the row is full, the hash CAM block is conflicted and the 
block flag is deasserted at 727. 

[0121] If a match is not detected within the binary CAM (717), then at 721 a buffered binary 
CAM flag (stored during the most recent binary CAM load operation) is inspected to determine 
whether the binary CAM is full. If the binary CAM is full, the block flag is deasserted at 727. If 
the binary CAM is not full, then at 735 the binary CAM fill count and a logic high partial fill bit 
(indicated by the summation of 200 hex and the fill count in the example of Figure 49) are 
output as the block priority value, the block flag is asserted, and the next free binary CAM 
address (NFB A) and predetermined segment index (00 in this example) are output as the block 
index. 

[0122] Figure 50 is a flow diagram of an insert operation within hash CAM device that 
includes multiple hash CAM blocks each according to Figure 41 . At 741 , the block ID 
component of a device index generated during the NFA operation of Figure 48 is compared with 
the block ID of the overflow CAM (BIDofc) to determine whether the entry candidate is to be 
inserted within the overflow CAM. If so, then at 743 the entry is stored in the CAM array of the 
overflow CAM at the row and segment indicated by the block index component (BIN S el) of the 
NFA-generated device index. If the block ID does not match the overflow CAM block ID, then 
the block ID corresponds to a hash CAM block ID, and the binary CAM flag of the hash 
indicated hash CAM block is evaluated at 745. If the binary CAM flag is set, then a matching 
hash index was located within the binary CAM and, at 747, the entry is stored in the hash CAM 
block memory at the row and segment indicated by block index component of the NFA- 
generated device index. If the binary CAM flag is not set, then no matching hash index was 
located within the binary CAM of the indicated hash CAM block. Accordingly, at 749, the 
NFA-generated hash index is stored at the next free address of the binary CAM (NFB A) of the 
indicated hash CAM block; at 75 1, the entry is stored in the hash CAM block memory at a 
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predetermined segment (e.g., segment 1) of the row indicated by the next free binary CAM 
address; at 753, the next free binary CAM address register is updated with the binary CAM index 
generated during the binary CAM load (that is the address of the next free binary CAM location); 
and at 755, the row fill count is incremented. 

[0123] Figure 51 is a flow diagram of a search operation within an i th one of the hash CAM 
blocks of Figure 41, i being any integer value from 1 to N (note that a device level search may be 
performed as described in reference to Figure 23). At 761, the class code of the search 
instruction (CC) is compared with the entry type of the CAM block (ETO . If the class code does 
not match the entry type, the block flag is deasserted at 777 and the search operation within the 
hash CAM block is concluded. If the class code matches the entry type, a key is generated at 
763 and used to generate a hash index (HIi) at 765. At 767, the binary CAM is searched for a 
match with the hash index generated at 765. If a matching hash index is detected within the 
binary CAM at 769, the contents of the corresponding row of the memory array are output to the 
output logic for comparison with the search key (771). If, at 773, the row of the memory array is 
determined to contain a valid entry that matches the search key, then a match has been detected. 
Accordingly, at 775, the block flag for the hash CAM device is asserted (e.g., BFi=l), the binary 
CAM index and segment address of the matching entry (SAme) are output as the block index 
(BINi), and the priority value stored within the matching entry (PRIme) is output as the block 
priority (BPi). If row does not contain a valid entry that matches the search key, then the block 
flag is deasserted at 777 to indicate that no match was detected. 

Overflow CAM having Search-Based NFA Function 
[0124] Figure 52 illustrates an embodiment of an overflow CAM block 800 having a 
programmable data storage width and a programmable priority function. The overflow CAM 
block 800 includes a CAM array 801, address logic 811, mask logic 813, priority index table 
803, priority encoder 805, read/write circuit 815, match flag logic 807, index select logic 821, 
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priority select logic 809 and block configuration register 819. The configurable CAM array 
includes Y rows 822 1 -822 y of CAM cells with each row segmented into a tag (T) segment and 
four entry segments (S1-S4). More or fewer segments may be provided in alternative 
embodiments, and the tag segment may be omitted. Each of the CAM cells includes storage for 
at least one data bit, and a compare circuit to compare the data bit with a corresponding bit of a 
search key 210. The CAM cells may be binary CAM cells, ternary CAM cells (i.e., CAM cells 
that permit selective masking of mismatch conditions), range compare cells, or any combination 
of binary, ternary and/or range compare CAM cells. One or more of the row segments may also 
include a different number of CAM cells than others of the row segments. For example, in one 
embodiment, the tag segment includes four CAM cells to store three tag bits and a validity bit, 
while each of the entry segments, S1-S4, includes more than three CAM cells. 
[0125] The CAM array 801 can be configured into different width by depth configurations by 
programming an entry size value into the configuration register 819. The entry size information 
is output as a set of one or more signals 840 (CFG) to configuration dependent circuit blocks 
within the overflow CAM block 800. In the exemplary embodiment shown in Figure 52, each 
row of CAM cells within the CAM array 801 may be configured to store four data entries that 
each span a single entry segment (xl configuration), two data entries that each span two entry 
segments (x2 configuration), or a single entry that spans all four entry segments (x4 
configuration). In alternative embodiments, the overflow CAM block 800 may have more or 
fewer segments and/or more or fewer configuration options. Also, other types of configuration 
information may be programmed within configuration register 819 in alternative embodiments. 
[0126] When an entry is loaded into the CAM array 801, the address logic 81 1 decodes an 
address received via bus 230 to assert an indicated one of the word lines 830 (WLi-WL Y ), and 
the read/write circuit 815 drives an entry onto array bit lines 834 of the appropriate entry 
segment or segments (as discussed above, a segment address portion of the incoming address 
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may be used in conjunction with the configuration value to select one or more entry segments to 
be written). A priority value may be loaded into the priority index table 803 (e.g., by the 
operation of read/write circuit 815 to drive a priority value onto priority bit lines 836) 
concurrently with the loading of the corresponding entry within the CAM array 801 or at a 
different time. In the xl configuration, for example, entries stored within segments S1-S4 of a 
given row of the CAM array 801 correspond to priority values stored within priority storage 
circuits P1-P4 in the priority index table 803. In one embodiment, the priority values used in the 
xl, x2 and x4 configurations are the same size (i.e., have the same number of constituent bits). 
Consequently, in the xl embodiment, each of the four priority storage circuits P1-P4 is used to 
store a priority value that corresponds to an entry in a corresponding one of entry segments Sl- 
S4, while in the x2 configuration, only two of the priority storage circuits are used to store 
priority values (i.e., to correspond to respective entries that span two entry segments), and in the 
x4 configuration, only one of the priority storage circuits is used to store a priority value (i.e., to 
indicate the priority of a corresponding x4 entry). In the x2 and x4 configurations, the unused 
priority circuits may be disabled or loaded with null data (e.g., minimum priority values). In 
alternative embodiments, differently sized priority values may be used for different CAM array 
configurations, and all or a portion of one priority storage circuit may be concatenated with 
another priority storage circuit to enable storage of larger sized priority values. 
[0127] The priority values stored within the priority index table 803 indicate the relative 
priorities of corresponding entries within the CAM array 801 and may be assigned in ascending 
priority order or descending priority order. During a search operation, search key 210 is 
simultaneously compared with all the entries within the CAM array 801 to generate a set of 
match signals 810i-810y, each indicating whether an entry or entries within a corresponding row 
of the CAM array 801 matches the search key 210. The priority values within the priority index 
table 803 that correspond to key-matching entries within the CAM array 801 (i.e., match- 
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qualified priority values) are compared with one another to determine a highest priority match- 
qualified priority value. The highest priority match-qualified priority value is output from the 
priority index table 803 as a search priority value 832 (SP) and is received by the priority select 
logic 809. The priority select logic 809 selects, according to the state of NFA signal 212, either 
the search priority value 832 or an insert priority value to be output as the block priority value 
822. More specifically, during an NFA operation, when the NFA signal 212 is high, the priority 
select logic 809 outputs an insert priority value, and during a search operation, when the NFA 
signal is low, the priority select logic 809 outputs the search priority value 832. 
[0128] The priority index table additionally outputs a set of qualified match signals 812 to the 
priority encoder 805 and match flag logic 807. Each of the qualified match signals corresponds 
to a row of the priority index table 803 (and therefore to a row of CAM array 801) and is asserted 
if a match qualified priority value equal to the search priority value 832 is stored in the priority 
index table row. 

[0129] Figure 53 illustrates an embodiment of the priority index table 803 of Figure 52 (other 
embodiments may be used). The priority index table 803 includes a segmented priority number 
storage array 841 (referred to herein as a priority array), tag logic circuits 847i-847 Y , column 
priority logic 843, and row logic circuits 845 1 -845 Y . The priority index table 803 receives 
priority values from, and outputs priority values to read/write circuit 815 of Figure 52 (e.g., via 
bit lines not shown in Figure 53). During a search or insert operation in the CAM array, the 
priority index table 803 receives Y sets of match signals 810i-810 Y from the CAM array 801 and 
generates Y corresponding sets of qualified match signals 812i-812 Y in accordance an operation 
select signal, OPSEL (generated, for example, by the instruction decoder 302 of Figure 15) and 
the entry size information, CFG 840. 

[0130] Each of the Y tag logic circuits 847 receives a respective match signal 810 from a 
corresponding row of the CAM array. In one embodiment, each of the match signals 810 
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includes five component match signals MT and M1-M4, that correspond to the tag segment and 
entry segments S1-S4, respectively. Each of the tag match logic circuits outputs a respective set 
of four tag match signals, TM1-TM4, with each of the tag match signals being asserted if (1) a 
tag value stored in the tag segment matched a class code value associated with the search key 
(i.e., the class code value (CC) discussed above in reference to Figure 17), and (2) an entry 
stored in the corresponding entry segment (or, in the x2 and x4 configurations, in a group of two 
or four entry segments) matched the search key. The entry size information, CFG 840, is input 
to the tag logic circuits 847 1 -847 Y to qualify the detection of a match condition. 
[0131] In the embodiment of Figure 53, the priority array 841 includes Y rows of priority cells, 
each segmented into four priority storage circuits P1-P4. During a compare operation, each row 
of priority storage circuits receives a respective set of four tag match signals, TM1-TM4 from a 
corresponding tag logic circuit and outputs a corresponding set of four prioritized match signals 
844 (PM1-PM4). Each asserted tag match signal is used within the priority array to enable a 
corresponding priority storage circuit to participate in a priority value compare operation with 
other such enabled priority storage circuits within the same column of priority storage circuits 
(the enabled priority storage circuit and priority value stored therein being referred to herein as a 
match-selected priority storage circuit and match-qualified priority value, respectively). The 
priority value compare operation within each column of priority storage circuits (i.e., PI i-Pl Y , 
P2i-P2 Y , etc.) is referred to herein as a column priority comparison and produces a respective 
one of column priority values CP1-CP4 (i.e., the highest priority of the match-qualified priority 
values within the column) and results in assertion of a prioritized match signal 844 for each 
match-qualified priority value that is equal to the column priority value. As an example, if, 
during a compare operation, there is a match-qualified priority value within each of the four 
columns of priority storage circuits of the priority array 841, then four column priority values 
will be output from the priority array 841 to the column priority logic 843, and at least four 
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prioritized match signals will be asserted (i.e., at least one for each column of priority storage 
circuits). More than one prioritized match signal may be asserted for a given column if the 
column contains more than one match-qualified priority value equal to the column priority value, 
thus providing a potential source of multiple-match indications. 

[0132] Figure 54 shows priority index table 900 that is one embodiment of two priority storage 
circuits, 90 lj and 90 1 Y , of priority array 841 of Figure 53. Each priority storage circuit includes 
a chain of n priority cells, with each cell including a compare circuit 906, isolation circuit 904 
and storage element 871. Each compare circuit 906 is connected in a wired-OR configuration 
with the other compare circuits in its respective column by one of priority signal lines 908i-808 n . 
Each priority signal line may be pre-charged towards a power supply voltage (or any other 
predetermined voltage) by a pre-charge circuit 902. Each compare circuit 906 may be any digital 
or analog compare circuit that compares the priority value bit stored in the corresponding storage 
element 871 with the priority value bits stored in every other storage element 871 of the same 
column (i.e., bits 871ij-871 i, Y are compared; bits 871 2 ,i-871 2 ,y are compared, and so forth). 
Additionally, each compare circuit 906 monitors the comparison result of the more significant 
priority value bits through the logical states of match line segments 910. Match line segments 
910 are coupled between match lines that carry incoming tag match signals TM1 r TMl Y and 
match lines that carry outgoing prioritized match signals PM1 i-PMl Y by isolation circuits 904. 
The isolation circuits 904 isolate the comparison results generated for less significant priority bit 
locations from affecting the comparison results generated for more significant priority bit 
locations. The isolation circuits 904 may also work together with the comparison circuits 906 to 
control the state of the match line segments 910. 

[0133] The operation of priority index table 900 can be illustrated with an example shown in 
Figure 55. In this example, priority index table 900 comprises a 2x4 matrix of rows and 
columns. For other embodiments, any numbers of rows and columns can be used. Row one 
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stores priority number 0110 having the decimal equivalent of the number 6, and row two stores 
priority number 0101 having the decimal equivalent of the number 5. For this example, the 
corresponding row of the CAM array contains segment entries that result in assertion of tag 
match lines TMi and TM2. Also, for this example, the priority numbers are stored in ascending 
priority order such that 0101 is the more significant (i.e., "higher priority") priority number 
between 0101 and 0110. 

[0134] Compare circuits 906i,i-906 4 ,2 determine that 0101 is the more significant priority 
value, and cause PM1 2 to be asserted as follows. The most significant bit CP 1 4 is resolved first. 
When any memory element 871 stores a logic zero and the corresponding match line segment 
910 is asserted, the corresponding priority signal line 908 is discharged. Thus, each of compare 
circuits 906 4 ,2 and 906 4j i discharge signal line 908 4 such that CP1 4 is a logic zero. Additionally, 
compare circuit 9064,2 compares the state of priority signal line 9O84 with the priority number bit 
stored in 871 4,2, and determines that both have the same logic state. This causes compare circuit 
9064,2 not to affect the logical state of match line segment 910 3 ,2 such that match line segment 
9103,2 has the same logic state as match line segment 910 4 ^ (TMI 2). Similarly, compare circuit 

906 4 .1 compares the state of priority signal line 9O84 with the priority number bit stored in 871 4,1 
and determines that both have the same state. This causes compare circuit 9064,1 not to affect 
the logical state of match line segment 9 103,1 such that match line segment 9103,1 has the same 
logic state as match line segment 9104,i (TMli). 

[0135] The next most significant bit CPI3 is then resolved. Memory elements 871 that store a 
logic one do not discharge their corresponding priority signal lines 908. Since memory elements 

871 3.2 and 871 3,1 both store logic one states, signal line 9O83 remains pre-charged such that CP1 3 
is a logic one. Additionally, compare circuit 9063,2 compares the state of priority signal line 9O83 
with the priority number bit stored in 8713,2, and determines that both have the same logic state. 
This causes compare circuit 9063,2 not to affect the logical state of match line segment 910 2 ,2 
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such that match line segment 9102,2 has the same logic state as match line segment 9103,2. 
Similarly, compare circuit 9063,1 compares the state of priority signal line 9O83 with the priority 
number bit stored in 8713,1 and determines that both have the same logic state. This causes 
compare circuit 906 3j i to not affect the logical state of match line segment 910 2 ,i such that match 
line segment 910 2 ,i has the same logic state as match line segment 91 03,1. 
[0136] CP 1 2 is resolved next. Since memory element 87 1 2 ^ stores a logic zero and match line 
segment 910 2j 2 is asserted, compare circuit 906 2 ,2 discharges priority signal line 908 2 . This 
causes CP 1 2 to be a logic zero. Additionally, compare circuit 9062,2 compares the logic zero state 
of priority signal line 908 2 with the logic zero stored in 871 2,2 and allows match line segment 
9101,2 to have the same state as match line segment 9102,2. Compare circuit 9062,1, however, 
compares the logic zero on priority signal line 908 2 with the logic one stored in memory element 
871 2 ,i, and de-asserts match line segment 910i,i. When a match line segment is de-asserted, all 
subsequent compare circuits for that row will de-assert the remaining match line segments of the 
row such that the corresponding prioritized match signal PM1 1 will be de-asserted. When the 
prioritized match signal is de-asserted for a particular segment, this indicates that the most 
significant priority number is not stored in that segment. Additionally, when the remaining match 
line segments are de-asserted for a row, the compare circuits for that row do not discharge the 
remaining priority signal lines regardless of the logic states stored in the corresponding memory 
elements of that row. For example, compare circuit 906i,i does not discharge priority signal line 
908i even though memory element 871 ij stores a logic zero. Additionally, isolation circuits 
9044,1, 90431, and 904 2 ,i isolate the de-asserted match line segment 910i,i from match line 
segment 910 4 ,i, 910 3 ,i, and 910 2 ,i such that CP1 4 , CP1 3 , and CP1 2 are not affected by the de- 
assertion of match line segment 910i,i. 

[0137] Lastly, the least significant bit CP 1 1 is resolved. Compare circuit 906i, 2 alone 
determines CP1 1 since compare circuit 906 1,1 cannot discharge priority signal line 908 1. Since 
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memory element 871 1,2 stores a logic one and match line segment 91 is asserted, compare 
circuit 906 1,2 leaves priority signal line 908 1 pre-charged 5 and CP 1 1 is a logic one. Additionally, 
compare circuit 906 1,2 allows prioritized match signal PM1 2 to have the same state as match line 
segment 910^2. Since match line segment 910^2 is asserted, PMI2 will be asserted indicating 
that the most significant priority number is stored in that row. 

[0138] Thus, when the priority comparison is completed, bits CPI4-CPI 1 indicate that the most 
significant priority number stored in the priority index table is 0101, and prioritized match signal 
PMI2 is asserted to signal that 0101 is stored in row two. 

[0139] For the example described above with respect to Figure 55, the most significant priority 
number is the lowest number such that 0101 is the most significant number between 0101 and 
01 10. For another embodiment, the priority numbers are stored in descending priority order such 
that 01 10 is the most significant priority number between 0101 and 0110. 
[0140] Any circuits may be used for compare circuits 906 and/or isolation circuits 904 to 
implement the priority comparison illustrated above. Table 2 shows one example of a truth table 
for implementing each compare circuit 906, where X (column) and Y (row) are any integers. 
Other truth tables may be used (and corresponding logic generated accordingly) including those 
that logically complement one of more or the signals indicated in Table 2. Any logic or circuitry 
may be used to implement the truth table of Table 2. 
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Table 2 



[0141] Figure 56 shows one embodiment of a circuit 930, referred to herein as a priority logic 
element or priority cell, for implementing the truth table of Table 2. The priority cell 930 
includes compare circuit 933, isolation circuit 931, and memory element 872. Compare circuit 
933 is one embodiment of compare circuit 906, and isolation circuit 931 is one embodiment of 
isolation circuit 904. The priority cell 930 may be used to implement all the priority cells in the 
priority index table. 

[0142] Compare circuit 933 includes inverter 944, transistors 936 and 938 connected in series 
between priority signal line 908 and ground, and transistors 940 and 942 connected in series 
between match line segment 91 Om and ground. N-channel transistor 936 has its drain coupled to 
signal line 908, its gate coupled to match line segment 910i, and its source coupled to the drain of 
n-channel transistor 938. Transistor 938 has its gate coupled to receive the logical complement 
of the priority number bit (/D) stored in memory element 871, and its source coupled to ground. 
N-channel transistor 940 has its drain coupled to match line segment 910i_i, its gate coupled to 
signal line 908 via inverter 944, and its source coupled to the drain of N-channel transistor 942. 
Transistor 942 has its gate coupled to receive the priority number bit (D) stored in memory 
element 871, and its source coupled to ground. Any of transistors 936, 938, 940, and 942 can be 
replaced with other types of transistors and the logic adjusted accordingly. 
[0143] Isolation circuit 931 includes inverters 932 and 934. For alternative embodiments, only 
one inverter may be used and the logic of the next compare circuit adjusted accordingly. For 
other embodiments, other isolation circuits such as one or more AND, OR, or XOR logic gates 
or pass gates may be used. 

[0144] Reflecting on the operation of priority cell 930, it can be seen that at least two principal 
functions are performed. First, the compare circuit 933 pulls down the column priority signal 
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line 908 if (1) the incoming match line segment 910j is high (thereby indicating that priority cell 
930 is enabled to participate in the bitwise priority comparison with other priority cells in the 
same column), and (2) the priority number bit is the lowest valued priority number bit in the 
column (i.e., the priority number bit, D, is a 0). Second, the priority cell 930 selectively enables 
or disables the next less significant priority cell from affecting the bitwise priority comparison 
for its column by setting the state of the outgoing match line segment 910i-i. Specifically, the 
priority cell 930 pulls down the outgoing match line segment 9l%\ to disable the next less 
significant priority cell from affecting the bitwise priority comparison for the next less 
significant column if (1) the incoming match line segment 910i is low (i.e., causing 910j_i to be 
pulled down via isolation circuit 93 1) or (2) the column priority signal line 908 settles to a low 
state and the priority number bit, D, is a 1 (i.e., priority cell 930 is a loser of the column priority 
comparison). In a priority table having Y rows (priority cell 930 being at column i, row j), the 
above described functions may be described by the following Boolean expressions: 
/CPli = (910i, , * /Dy) + (910 ij2 * /DyO + .... + (910 ijY * /D ijY ) (1) 
/910 i .ij = /91Qy + (D u */CPl i ) (2) 
Substituting (1) for CPli in expression (2) yields: 

/910 M j = /910g + [Dy * {(910;, , * /Dy) + (91 * /D ij2 ) + .... + (910 i>Y * /D i>Y )}] 
Thus, if the incoming match line segment 910i is high and the priority number bit, D, is a 1, the 
outgoing match line segment 910m may not settle to its final state until all (or a significant 
number) of the priority cells within the same column have completed their comparisons of 
priority number bits and input match line segment states. Because of this potentially long 
settling time, it becomes possible for the outgoing match line segment 910i_i to temporarily be 
high, before ultimately being pulled low. Consequently, the next less significant priority cell 
within may be enabled to temporarily pull the next less significant column priority signal line 
low (i.e., CPli_i=0) before finally being disabled by the low going match line segment 910j_i. 
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While an array of the priority cells 930 is self-correcting and will ultimately output the correct 
set of priority number bits as the column priority value and will assert the proper prioritized 
match signals, the temporary transition of one or more priority cells to an incorrect state requires 
time to correct. That is, transistor states require time to be reversed, and signal lines require time 
to charge/discharge, thereby slowing the overall generation of prioritized match signals and 
determination of a column priority number. 

[0145] Figure 57 illustrates an alternate embodiment of a priority cell 945 that includes timing 
circuitry to prevent the priority cell 945 from affecting the state of its column priority signal line 
908 until the state of the incoming match line segment 910j has settled. The priority cell 945 
includes an isolation circuit 931, compare circuit 948, and memory element 871, all coupled in 
the manner described in reference to Figure 56 to match line segments 910i and 91 0m and 
column priority signal line 908. The compare circuit 948 is identical to the compare circuit 933 
except that a ripple control transistor 946 is coupled between the source of transistor 938 and 
ground. A gate terminal of ripple control transistor 946 is coupled to an enable signal line 941 
(ENi) so that, until a ripple enable signal is asserted on line 941 (i.e., driving line 941 high), 
compare circuit 948 is disabled from affecting the state of the column priority signal line 908. 
Thus, even if transistors 936 and 938 are switched on (and therefore poised to pull line 908 low), 
the compare circuit 948 will not affect the state of the column priority signal line 908 until the 
ripple enable signal is asserted on line 941. Because other priority cells within the same column 
also include a ripple control transistor 946 having a gate terminal coupled to line 941, the state of 
the column priority signal line 908 will remain unaffected (i.e., line 908 will remain precharged 
to a high logic level via circuit 902 in this example). Further, by providing separate enable 
signal lines 941 for each column of priority cells, and asserting respective ripple enable signals 
on the lines 941 at staggered times (i.e., one after another, progressing from most to least 
significant column), the priority index table can be made to generate a set of prioritized match 
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signals and a column priority value without any priority cells transiently entering an incorrect 
state. Overall, a priority index table implemented with priority cells 945 may exhibit faster 
operation than a priority array implemented using the priority cells 930 of Figure 56. Also, 
because the state of each column priority signal line 908 is settled before a less significant 
column priority signal line is affected, isolation circuit 931 may be omitted from priority cell 945 
(i.e., a single shared match line may be coupled to each of the priority cells in a given row). 
[0146] Figure 58 is a waveform diagram of four ripple enable signals, RE1-RE4, that may be 
used to control a priority index table containing four columns of priority cells according to 
Figure 57. As shown, ripple enable signal RE4 goes high at time Tl to enable the priority cells 
in the most significant column of priority cells to set the state of the most significant column 
priority signal line (i.e., pull down or not pull down line 908 of Figure 57) and to set the states of 
their respective row match lines (i.e., pull down or not pull down line 910 of Figure 57). After a 
predetermined time, T R , ripple enable signal RE3 goes high to enable the priority cells in the next 
less significant column to set the state of the next less significant column priority signal line and 
to set the states of their respective row match lines (i.e., if not already pulled down). Ripple 
enable signals RE2 and RE1 are similarly asserted at subsequent, staggered Tr intervals to 
enable the operation of their respective columns of priority cells. 

[0147] In one embodiment, Tr is a fixed value that corresponds to a time required for a column 
of priority cells to resolve the state of the column priority signal line (e.g., resolution time plus a 
tolerance value). Alternatively, T R may be a programmable value within the CAM device (e.g., 
stored in the configuration register 819 of Figure 53 or other volatile or nonvolatile storage) that 
may be set once (e.g., production time setting according to supply voltage level, empirically 
determined operating speed of the device (e.g., binning), customer specification, etc.), or may be 
established during device run-time (e.g., as part of device initialization or periodically to 
compensate for changes in operating conditions such as supply voltage and temperature). 
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[0148] A predetermined time after ripple enable signal RE1 has been asserted, Tc, the priority 
number comparison is completed and all the ripple enable signals may be deasserted in 
preparation for the next priority number comparison. In the example of Figure 58, each of the 
ripple enable signals is a pulse that is phase offset from the others of the pulses by time T R so 
that both the assertion and deassertion times of the ripple enable signals is staggered. Although 
T c is depicted in Figure 58 as being a longer time than T R , Tc may alternatively be equal to or 
shorter than T R . 

[0149] Referring again to Figure 53, the column priority logic 843 compares the column 
priority values received from the priority array 841 to generate a search priority value, SP, that is 
the highest priority one of the column priority values. The column priority logic 843 further 
generates a set of four segment enable signals 852 SE[4:1], each segment enable signal 852 
being asserted or deasserted according to whether a corresponding one of the four column 
priority storage circuits contains a priority value equal to the search priority value. Thus, in the 
embodiment of Figure 53, if only one column of priority storage circuits contains a priority value 
equal to the search priority value, then only one of the four segment enable signals 852 will be 
asserted. Conversely, if more than one column of priority storage circuits contains a priority 
value equal to the search priority value, then more than one of the four segment enable signals 
852 may be asserted. The row logic circuits 845 1 -845 Y receive respective sets of the prioritized 
match signals 844i-844 Y from the priority array 841 and output, according to the segment enable 
signals 852, respective sets of qualified match signals 812i-812 Y , each set of qualified match 
signals including four component match signals, QM1-QM4. In one embodiment, the column 
priority logic 843 is implemented in the same manner as the search priority logic 550 of Figure 
34, except that the segment flag signals and OR gates 541, 543, 545 and 547 are omitted (i.e., the 
column priority values are input directly to the comparators); the outputs of AND gates 555, 557, 
559 and 561 corresponding to the segment enable signals SE1-SE4. In alternative embodiments, 
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other circuits may be used to implement the column priority logic 843 including, without 
limitation, a priority array as described in reference to Figures 52-54 in which the priority values 
are supplied from priority array 843 rather than being self contained. 

[0150] Figure 59 illustrates an embodiment of the tag logic 847 of Figure 53. The tag logic 
847 includes AND gates 951 i-951 4 , 952, 953 and 954, and multiplexer bank 955. The tag logic 
847 receives a row match signal 810 from a corresponding row of the CAM array, the row match 
signal 810 including tag segment match signal (MT) and entry segment match signals (M1-M4) 
as discussed. The tag logic 847 includes four AND gates 951 1-95 1 4 each of which receives the 
tag segment match signal (MT) at a first input and a respective one of the entry segment match 
signals (M1-M4) at a second input. The output of each of the AND gates 951 1 -951 4 is output to 
a xl port of a respective one of the multiplexers 955 so that, when the entry size information 840 
(CFG) indicates a xl mode, the outputs of AND gates 951 1 -951 4 are selected to be the tag match 
signals TM1-TM4, respectively. Thus, in the xl configuration, tag match signal TM1 is asserted 
if the tag segment match signal (MT) (i.e., indicating that the tag matched a class code for the 
search key) and the entry segment 1 match signal (Ml) (i.e., the entry in segment 1 matched the 
search key) are both asserted. The tag match signals TM2, TM3 and TM4 are similarly asserted 
if the tag segment match signal (MT) is asserted and entry segment match signals M2, M3 and 
M4, respectively, are asserted. 

[0151] If the entry size information 840 indicates a x4 configuration, the output of AND gate 
952 is selected to drive each of the tag match signals TM1-TM4. AND gate 952 performs a 
logical AND of the outputs of all the AND gates 951 1 -951 4 and therefore outputs a logic high 
signal only if all the incoming segment match signals (MT and Ml -M4) are high. Thus, in the 
x4 condition, all the tag match signals TM1-TM4 are asserted (i.e., to a high logic state in this 
example) if (1) the stored tag matches the class code associated with the search key, and (2) a x4 
entry stored in segments 1-4 matches the search key (as indicated by assertion of signals Ml- 
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M4). If the stored tag does not match the class code, or any of the entry segment match signals 
indicate a mismatch condition, all the tag match signals TM1-TM4 are deasserted to indicate a 
mismatch. 

[0152] If the entry size information 840 indicates a x2 configuration, the output of AND gate 
953 is selected to drive tag match signals TM1 and TM2, and the output of AND gate 954 is 
selected to drive tag match signals TM3 and TM4. AND gate 953 performs a logical AND of 
the outputs of AND gates 951 1 and 951 2 and therefore outputs a logic high signal if (1) the stored 
tag matches the class code associated with the search key, and (2) a x2 entry stored in segments 1 
and 2 matches the search key (as indicated by assertion of signals Ml and M2). If the stored tag 
does not match the class code or if either of entry segment match signals Ml and M2 indicate a 
mismatch condition, tag match signals TM1 and TM2 are both deasserted to indicate a mismatch. 
AND gate 954 performs a logic AND of the outputs of AND gates 95 1 3 and 95 1 4 and therefore 
outputs a logic high signal if (1) the stored tag matches the class code associated with the search 
key, and (2) a x2 entry stored in segments 3 and 4 matches the search key (as indicated by 
assertion of signals M3 and M4). If the stored tag does not match the class code or if either of 
entry segment match signals Ml and M2 indicate a mismatch condition, tag match signals TM1 
and TM2 are both deasserted to indicate a mismatch. 

[0153] Note that for all entry size configurations, the tag match signals TM1 -TM4 are 
deasserted if the tag segment match signal is deasserted (indicating a mismatch between the 
stored tag and class code for the search value). In alternative embodiments, the tag segment may 
be used to qualify the entry segment match signals for only a set of one or more of the entry size 
configurations, with the set of configurations being fixed or programmable (e.g., by a 
configuration setting within register 819 of Figure 52). In another alternative embodiment, the 
match the tag segment may be omitted from the CAM array 801 and the tag logic 847 omitted 
from the priority index table 803. 
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[0154] Figure 60 illustrates an embodiment of the row logic 845 of Figure 53. As discussed 
above, the column priority logic 843 compares the column priority values (CP1-CP4) received 
from the priority array to generate a plurality of segment enable signals 852i-852 4 (SE1-SEZ), 
each segment enable signal 852 indicating whether the corresponding column priority value is 
equal to the search priority value 832 (SP) (i.e., a highest priority one of the column priority 
values). The row logic 845 includes AND gates 956i-956 4 to logically AND the prioritized 
match signals PM1-PM4 output from a given row of the priority array with the segment enable 
signals SE1-SE4. The output of each logic AND gate 956 is coupled to a first input port of a 
respective one of multiplexers 957i-957 4 , so that, when an operation select signal (OPSEL) 
indicates a search operation or NFA operation, the outputs of AND gates 956i-956 4 are selected 
to be the qualified match signals 812 (QM1-QM4). Thus, during a search or NFA operation, 
qualified match signal QM1 will be asserted if the corresponding prioritized match signal (PM1) 
is asserted and segment enable signal SE1 is asserted. That is, qualified match signal QM1 will 
be asserted if PM1 indicates that the corresponding priority value is equal to the column priority 
value for column 1 of the priority array, and SE1 indicates that the priority value is equal to the 
search priority 832 (i.e., the priority value is an intra-column and inter-column winner of priority 
comparisons). Qualified match signals QM2, QM3 and QM4 are similarly asserted if their 
corresponding priority values are intra-column and inter-column winners of priority 
comparisons. 

[0155] Still referring to Figure 60, if the operation select signal (OPSEL) indicates a write to 
the CAM array, then validity values VI -V4 are selected by multiplexers 95 7 1 -95 1* to be output 
as qualified match signals QM1-QM4 instead of the signals generated by AND gates 956. In one 
implementation, the validity values are active low signals which, if high, indicate that the 
corresponding entry segment within the CAM array does not have a valid entry stored therein. 
That is, the validity values may be interpreted as active high unoccupied (empty) signals. 
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Accordingly, when the validity values are selected to be output as the qualified match signals, 
QM1-QMZ, the qualified match signals effectively represent a set of empty flags for the CAM 
array. The empty flags may be used within the priority encoder 805 of Figure 52 to generate an 
index of a next free address within the CAM array 801. In one embodiment, the validity values 
are formed by one or more bits stored in the CAM array 801 within the corresponding entry 
segment. In an alternative embodiment, to facilitate circuit layout, validity storage circuits are 
provided both in the CAM array and in a location physically near the multiplexers 957. In the 
latter embodiment, the validity values stored in the validity storage circuits located near the 
multiplexers 957 mirror the values stored within the CAM array and are used to drive the 
qualified match signals when a write operation is selected. In an embodiment in which a 
predetermined entry value (e.g., code) is used to indicate the presence or absence of a valid entry 
within an entry segment, the multiplexers 957 may be omitted altogether. 
[0156] Referring again to Figure 52, the match flag logic 807 generates a block flag 824 (BF) 
for the overflow CAM device 800 according to the states of the qualified match signals 812 and 
the block configuration signal 840 (CFG). In one embodiment, for example, the match flag 
logic 807 asserts the block flag 824 in accordance with the following Boolean expression (the 
symbol indicates a logical OR operation and the symbol indicates a logical AND operation): 

BF = xl# [(QM1i+QM1 2 +.-. + QM1y) + (QM2i+QM2 2 +... + QM2y) + 
(QM3 1 + QM3 2 + . . . + QM3 Y ) + (QM4, + QM4 2 + . , . + QM4 Y ) ] + 

x2 • [ (QMli* QM2,) + (QM1 2 * QM2 2 ) + ... (QM1 Y # QM2 Y ) + 
(QM3i# QM4i) + (QM3 2 # QM4 2 ) + . . . (QM3 Y # QM4 Y ) ] + 

x4m [ (QMl,*QM2i#QM3i#QM4i) + (QMl 2 »QM2 2 #QM3 2 «QM4 2 ) + 
(QM1 Y • QM2 Y • QM3 Y • QM4 Y ) ] 
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[0157] Any logic circuit that produces the above Boolean result may be used to implement the 
block flag logic, and, in alternative embodiments, different Boolean expressions may be used. 
For example, in an embodiment in which qualified match signals QM2 and QM4 are ignored in 
x2 mode and qualified match signals QM2, QM3 and QM4 are ignored in x4 mode, the block 
flag signal may be asserted in accordance with the following Boolean expression: 

BF = xl« [(QMli + QMl 2 +... + QMl Y ) + (QM2, + QM2 2 + . . . + QM2 Y ) + 
(QM3, + QM3 2 + '. . . + QM3 Y ) + (QM4, + QM4 2 + . . . + QM4 Y ) ] + 

x2 • [ (QMli + QM1 2 + . . . + QM1 Y ) + (QM3 , + (QM3 2 + . . . + QM3 Y ) ] + 

x4# (QMl! + QMl 2 +... + QMl Y ) 
[0158] Still referring to Figure 52, the priority encoder 805 generates an address, referred to as 
an entry index 828 (EIN), that corresponds to the row and segment of the priority index table 
(and therefore to corresponding row and entry segment of the CAM array 801) indicated by an 
asserted qualified match signal 812. If more than one qualified match signal 812]-812 Y is 
asserted, hardwired tie resolution logic within the priority encoder 805 generates an index that 
corresponds to the lowest numbered row and entry segment for which a qualified match signal is 
asserted. Alternative tie resolution techniques may be used in alternative embodiments, 
including, but not limited to, providing logic to resolve ties in favor of higher numbered rows or 
providing configurable tie resolution logic to enable programmable selection between multiple 
different tie resolution policies. In either case, the entry index 828 generated by the priority 
encoder is provided to the index select logic 821 which selects, according to the state of the NFA 
signal 212 and a block flag signal 824, either the entry index 828 or a previously stored version 
of the entry index to be output as the block index 826 (BIN) for the overflow CAM 800. 
[0159] Figure 61 illustrates an embodiment of the priority encoder 805 of Figure 52 (other 
embodiments may be used). The priority encoder 805 includes OR gates 960i-960 Y , segment 
priority encoders 959i-959 Y , row priority encoder 961 and multiplexer 958. Each of the OR 
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gates 960i-960 Y receives a respective set of qualified match signals, QM1-QM4 (each set 
forming a respective one of qualified match signals 812i-812 Y ), and outputs a corresponding one 
of row match signals MRi-MR Y to the row priority encoder 961 . Thus, the row priority encoder 
961 receives Y row match signals (MR), each indicating whether any of the qualified match 
signals QM1-QM4 for the corresponding row are asserted. The row priority encoder 961 performs 
an encoding operation to generate a row index 962 (RIN) (i.e., row address) that corresponds to 
the row of the CAM array indicated by an asserted one of the row match signals MR. In one 
embodiment, if two or more of the row match signals MR are asserted (i.e., a multiple match 
condition), the row priority encoder 961 generates a row index 962 that corresponds to the lowest 
numbered one of the indicated rows (i.e., the row having the lowest physical address of the 
indicated rows). In alternative embodiments, different row prioritizing policies may be used to 
resolve multiple match conditions (e.g., selecting the row having the highest physical address). 
[0160] Each of the segment priority encoders 959i-959 Y also receives a respective set of 
qualified match signals QM1-QM4 and performs an encoding operation to generate a respective 
prioritized segment address PSA. In one embodiment, if two or more of qualified match signals 
QM1-QM4 are high, the segment priority encoder 959 generates a prioritized segment address 
that corresponds to the lowest numbered one of the qualified match signal (i.e., the qualified 
match signal that corresponds to the CAM array segment having the lowest physical address as 
compared with other segments within the same row). By this arrangement, the prioritized 
segment address generated in response to a match detection in the x4 configuration (i.e., all 
qualified match signals asserted) will indicate segment 1 (e.g., PSA = 00 binary), and the 
prioritized segment address generated in response to a match detection in the x2 condition (at 
least one pair of match signals asserted) will indicate segment 1 or segment 3 (e.g., PSA = 00 or 
1 0), as the case may be. Also, in the event of multiple matches within a single row in either the 
xl or x2 configuration, the address of the lowest numbered segment will be output as the 
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prioritized segment address. In alternative embodiments, different segment prioritizing policies 
may be used to generate prioritized segment addresses. 

[0161] The multiplexer 958 receives Y prioritized segment addresses (PSA r PSA Y ) from the 
segment priority encoders 959 r 949 Y and selects, according to the row index 962, one of the 
prioritized segment address to be output as a segment index 963 (SIN). Thus, the segment index 
963 is the prioritized segment address that corresponds to the row that yielded the row index 962. 
Together the row index 962 and segment index 963 form the block index 826 for the overflow 
CAM block. 

[0162] Referring again to Figure 52, the mask logic 813 is used to provide a global masking of 
the search key (e.g., masking of selective bits within the search key according to host-supplied or 
host-selected mask values). During an NFA operation, the NFA signal 212 is high and is used 
within the mask logic 813 to select a NFA mask to be applied to the incoming search key. 
Figure 62 illustrates the format of a NFA mask according to one embodiment (other 
embodiments may be used). The NFA mask includes a tag field (TAG) and four entry segment 
fields (S1-S4), with each entry segment field including a validity field (V) and a data field (DF). 
In the embodiment of Figure 62, the NFA mask is used during an NFA operation to mask the 
data field components of each entry segment field (as indicated by the shaded areas), and to 
unmask the tag field, and unmask the validity field components of each segment field. Thus, 
during an NFA operation, the class code associated with an entry candidate may be compared 
with the tag field within each row of the CAM array 801, and a predetermined logic value (e.g., 
logic 1) may compared with each of the four segment validity values within each row to learn 
whether the CAM array 801 contains a partially filled row that is tagged with the specified class 
code. That is, the tag segment match signal (MT of Figure 59) will be high for a given row if the 
corresponding tag segment contains a tag that matches the specified class code, and each of the 
entry segment match signals (M1-M4) will be high if the validity value within the corresponding 
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entry indicates an unoccupied status. Because the tag segment match signal is ANDed with the 
entry segment match signals to produce the tag match signals TM1-TM4, the tag match signals, 
if asserted, indicate an empty state for an entry segment of the CAM array for which the 
corresponding tag segment matches the class code. 

[0163] Referring to Figure 63, the priority select logic 809 includes a multiplexer 964 to select, 
according to the state of the NFA signal 212, either the search priority value 832 (SP) output 
from the priority index table 803 of Figure 52, or a predetermined priority value 970 to be output 
as the block priority value 822. In one embodiment, the predetermined priority value 970 is 
output during an NFA operation (i.e., when NFA signal 212 is high) to enable the overflow 
CAM to be selected for entry insertion should none of the hash CAM blocks output a higher 
priority insert priority value (i.e., as described above in reference to Figure 48). More 
specifically, in the exemplary embodiment of Figure 63, a priority value of 400 hex is output as 
the predetermined insert priority value for the overflow CAM so that, the overflow CAM will 
output a numerically higher (i.e., lower priority) priority value than a hash CAM block having a 
completely or partially unfilled row. In alternative embodiments, the overflow CAM block may 
output different insert priority values including, without limitation, a priority value indicative of 
a fill level of the overflow CAM block, a priority value indicative of whether all the entry 
segments within the row indicated by the block index are empty (or, conversely, that at least one 
of the entry segments is filled; a partially filled row status), a priority value that indicates both a 
partially filled row status and a block fill level, etc. 

[0164] Figure 64 illustrates an embodiment of the index select logic 821 of Figure 52 (other 
embodiments may be used). The index select logic includes a multiplexer 967, storage element 
965 and AND gate 969. The entry index 828 from the priority encoder is input to a first input 
port of the multiplexer 967 and also to a data input of the storage element 965. During an 
overflow CAM load operation the entry index 828 will be generated based on the validity values 
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for the CAM array entries (i.e., as represented by the qualified match signals) and therefore will 
represent the highest priority available location (i.e., next free address) within the overflow 
CAM. Because the NFA signal 212 will be low during the load operation, the logic AND gate 
969 will output a low signal to select the entry index 828 to be output as the block index 826 
(BIN). Also, the instruction decoder (not shown in Figure 64) will generate an overflow CAM 
write strobe signal 966 (OCWS) to strobe the entry index (i.e., next free overflow CAM address) 
into the storage element 965. During a search operation, the entry index 828 will be generated 
based on the qualified match signals and therefore will indicate the CAM array location 
containing the highest priority key-matching entry. Because the NFA signal 212 will be low 
during the search operation, the logic AND gate 969 will output a low signal to select the entry 
index 828 to be output as the block index 826. 

[0165] During an NFA operation, the NFA signal 212 is high, enabling AND gate 969 to 
select, according to the state of the block flag 824, either the entry index 828 or the next free 
overflow CAM address 968 (stored within storage element 965) to be output as the block index 
826. More specifically, if the block flag is high during a NFA operation (i.e., indicating an 
unoccupied entry segment(s) within a row containing a tag that matches the class code associated 
with the entry candidate) the inversion at the block flag input to the AND gate 969 causes the 
output of AND gate 969 to be low and therefore to select the entry index 828 (i.e., the index of 
an empty entry segment within the CAM array) to be output as the block index 826. If the block 
flag is low during a search operation, then no match was detected (indicating that no row 
contained both a tag that matched the class code and an empty entry segment), and the output of 
the logic AND gate 969 is high to select the next free overflow CAM address 968 to be output as 
the block index 826. Thus, during an NFA operation, an entry index 828 is output for a partially 
(or completely) empty row having a tag that matches the class code associated with the entry 
candidate, and the next free overflow CAM address 968 is output if there is no such row. 
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System Structure and Operation 
[0166] Figure 65 illustrates a system 980 that includes a host processor 975 (e.g., general 
purpose processor, digital signal processor, network processor, application-specific integrated 
circuit (ASIC), etc.), program store 979, CAM device 977 according to one of the embodiments 
described herein, and associated storage 111. The system device may be, for example, a network 
switch or router, or any other type of device in which the compare capability of the CAM device 
977 may be used. 

[0167] The host processor 975 executes program code stored within program store 979 (which 
may include one or more semiconductor storage devices or other storage media) and issues 
addresses, comparands, and instructions to the CAM device 977 via the address, data and 
instruction buses, respectively (i.e., ABUS 230, DBUS 140 and IBUS 200), and receives status 
and other information from the CAM device 1701 via a result bus (RBUS 990). In particular, the 
host processor 975 issues instructions to program or select the entry types, key masks, hash 
function selections, entry sizes, compare modes, fill policies and other programmable or 
selectable features of one or more CAM blocks within the CAM device 977, as discussed above. 
In the embodiment of Figure 64, flag signals 992 (e.g., device-level match flag, multiple match 
flag, full flag, etc.) are output directly to the host processor 975, however, the flag signals 992 (or 
a single flag signal) may alternatively or additionally be output to the host processor 975 via the 
result bus 990, for example in a status word. The device index 976 may be output to an 
associated storage (e.g., associated storage 111, which may be included within the same 
integrated circuit (IC) or IC package as the CAM device 977 and/or host processor 975) and/or to 
the host processor 975. The information output from the associated storage 1 1 1 (i.e., in response 
to the device index 976) may be provided to the host processor 975 (e.g., via path 978), or other 
processor or control device within the system 980. 



Atty. Docket No. NLMI.P1 12C 



-96- 



[0168] In alternative embodiments, one or more of the buses (e.g., ABUS, DBUS, IBUS, or 
REUS) may be omitted and the corresponding information time multiplexed onto another of the 
buses. Further, the CAM device 977 and host processor 975 may be implemented in distinct 
integrated circuits (ICs) and packaged in distinct IC packages, or in a single IC (e.g., in an ASIC, 
system-on-chip, etc.), or in an IC package that includes multiple ICs (e.g., a multi-chip package, 
paper thin package, etc.). Also, the system 980 may include multiple CAM devices 977 each 
programmed with a respective set of configuration values. For example, in one embodiment, 
system 980 includes multiple CAM devices according to hash CAM device 190 of Figure 6, 
along with an overflow CAM device. During a search or NFA operation, each of the CAM 
devices may be operated in parallel to generate a match address or insertion address. 
[0169] Numerous functions described as being implemented within the CAM devices of 
Figures. 3-64 may alternatively be implemented by the host processor 975 in response to 
instructions (i.e., software or firmware) stored within the program store 979. For example, in 
one embodiment the fill level of the binary CAM (or ternary CAM) within each hash CAM block 
is tracked by host processor 975 under program control. Each time a hash CAM block is 
selected for entry insertion at an unfilled row (an event that involves loading a new hash index 
into the binary CAM), the binary CAM fill count value for the hash CAM block is incremented. 
Similarly each time a binary CAM entry is deleted (e.g., due to deletion of an entry or entries in a 
corresponding row of the memory array), the binary CAM fill count value for the hash CAM 
block is decremented. Prior to selecting a hash CAM block for entry insertion at an unfilled row 
(a circumstance indicated by a device priority value having, for example, a logic high partial fill 
bit), the host processor inspects the binary CAM fill count value for the hash CAM block to 
determine if the count value has reached a maximum (i.e., completely full) level. If the count 
value indicates a full binary CAM, the host processor does not store the candidate entry in the 
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indicated hash CAM block. Other functions within the CAM devices of Figures 3-64 may 
similarly be implemented by the host processor 975 under program control. 
[0170] In the foregoing specification the invention has been described with reference to 
specific exemplary embodiments thereof. It will, however, be evident that various modifications 
and changes may be made thereto without departing from the broader spirit and scope of the 
invention. The specification and drawings are, accordingly, to be regarded in an illustrative 
rather than restrictive sense. 
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